22
UNCW REU FINAL PRESENTATION BY CATHERINE NANSALO 1

UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

UNCW REU FINAL PRESENTATIONBY CATHERINE NANSALO

1

Page 2: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

2

Outlin

e• Introduction

• Background

• Feature extraction technics

• Dimension reduction technics

• Methods

• Dataset

• Tuning Techniques

• Results

• Conclusions

• speculations

• Future Plans

Page 3: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

INTRODUCTION

ClassificationDimension Reduction

Feature extraction

imagevectors accuracies

vectors accuracies

3

With application from security to

entertainment, race classification

has been a growing field of study.

Before preforming race

classification on a facial image, the

first two steps are feature extraction

and dimension reduction.

Page 4: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

BACKGROUND

4

Page 5: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

FEATURE EXTRACTION

Feature extraction starts from an

initial set of raw data and builds

derived values, or features, intended

to be more informative than the raw

data. In this case the raw data is the

input image.

5

http://www.kdnuggets.com/2016/06/doing-data-science-kaggle-walkthrough-data-transformation-feature-extraction.html/2

Page 6: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

LOCAL BINARY PATTERN

• Comparing each pixel with its neighborhood

creating LBPs

• Compiling LBPs in to Histograms

• Concatenating histograms of each block to

create feature vectors foe the image

6

https://www.ijser.org/paper/Weighted-Local-Active-Pixel-Pattern-WLAPP-for-effective-Face-Recognition.html

Page 7: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

HISTOGRAMS OF ORIENTED GRADIENT

Count occurrences of

gradient orientation in

localized portion of an image

7

Orientation histogram are

concatenated to construct

the final features vector.

https://www.researchgate.net/figure/268214334_fig3_Fig-4-HOG-extraction-features-representation-Image-is-divided-in-cells-Each-cell-is

Page 8: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

BIO-INSPIRED FEATURES

Bio-Inspired features are based

on a recent theory of the

feedforward path of object

recognition in visual cortex

which accounts for the first

100-200 milliseconds of

processing in the ventral stream

of primate visual cortex

8

Band 1

Band 2

Image

Page 9: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

DIMENSIONALITY REDUCTION

9

Sometimes, features are

correlated, and hence redundant.

Dimension reduction is used to

reduce the number of features

while preserving the information

they provide.

http://www.geeksforgeeks.org/dimensionality-reduction/

Page 10: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

PRINCIPAL COMPONENT ANALYSIS

10

Orthogonal transformation to convert a set of

observations of possibly correlated variables into

a set of values of linearly uncorrelated variables

called principal components

They are the directions where there is the most

variance, the directions where the data is most

spread out.

Page 11: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

LINEAR DISCRIMINATE ANALYSIS

11

In in addition to finding the component axes that

maximize the variance of our data (PCA), we are

additionally interested in the axes that maximize

the separation between multiple classes (LDA).

So, in a nutshell, often the goal of an LDA is to

project a feature space (a dataset n-dimensional

samples) onto a smaller

subspace k (where k≤n−1k≤n−1) while maintaining

the class-discriminatory information.

http://sebastianraschka.com/Articles/2014_python_lda.html

Page 12: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

METHODS

12

Page 13: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

DATASET

Morph II

55,134 mugshots of 13,617 individuals collected

over 5 years. 1 and 53 pictures per person , with the

average number of pictures being 4

race, gender, and date of birth, date of arrest, age, and

age difference since last picture, subject identifier, and

picture number for each picture in the database.

Preprocessing

The images were rotated and cropped by the

positions of the eye center, nose and mouth. These

images are then converted to gray scale and reside to

60x70.

13

Page 14: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

SUBSETS

14

To have testing and training groups (S1,

S2), we create subset. With 10280

pictures each, S1 and S2 have

equivalent age distributions to Morph II

while maintaining independence

between subsets. The sub-sets have a 1

to 1 ratio of black to white, and a 1 to

3 ratio of females to males.

Morp

h II S1

S2

Remainder

Page 15: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

CLASSIFICATION

Support Vector Machines with radial basis

function.

The algorithm finds a hyperplane that

separates the different classes of the data.

For each test optimal values of gamma and

cost were found by using a multi-

parameter grid search.

15

Page 16: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

TECHNIQUES - FEATURE EXTRACTION TUNING

LBPs have two main parameters to tune: radius and block size. Combination

of radius=1, 2, 3, and block size=10, 12, 14, 16, 18, 20 were tested.

16

Page 17: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

TECHNIQUES - FEATURE EXTRACTION TUNING

17

BIFs have two main parameters to tune: gamma and block size. Combinations

of gamma=0.1, 0.2, ..., 1.0, and block sizes=15-29, 7-37 were tested.

Page 18: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

TECHNIQUES - FEATURE EXTRACTION TUNING

18

HOGs have two main parameters to tune: number of orientations and

block size. Combinations of orientations=4, 6, 8, and block size=4, 6, 8,

10, 12, 14 where tested.

Page 19: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

RESULTS

19

The table shows the accuracy rates for

race classification the combined methods

produced on the subsetted Morph II data

set. The combination of LBP for extraction

and LDA for reduction produces the

highest accuracy rate, but this is only .7%

better than the combination of BIF and

LDA. HOG extraction produces the

lowest accuracy of 49% where the

algorithm simple predicated the same race

for all images. The run time in seconds is

included for each test.

Page 20: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

CONCLUSION

It is clear from the results that HOG is over

all an ineffective feature extraction method

when classifying race. LBP and BIF on the

other hand both provide high accuracy rates

fro classification. The difference in accuracy

between PCA and LDA for dimension

reduction are very small (1.9% and 1%) and

may be insignificant. However the computation

time for PCA is about half that for LDA with

both LBP and BIF. About the same race

classification accuracy can be achieved in half

the time when using PCA over LDA for

dimension reduction.

20

Page 21: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

FUTURE WORK

Future work could explore more

methods for feature extracting and

dimension reduction

Or, could explore gender

classification.

21

Page 22: UNCW REU Final PResentationpeople.uncw.edu/chenc/STT592_Deep Learning/2017 NSF-REU-files/F… · DATASET Morph II 55,134 mugshots of 13,617 individuals collected over 5 years. 1 and

22

REFRENCES[1] Carcagnì, P., Del Coco, M., Mazzeo, P. L., Testa, A., & Distante, C. (2014, August). Features descriptors for demographic

estimation: a comparative study. In International Workshop on Video Analytics for Audience Measurement in Retail and Digital

Signage (pp. 66-85). Springer, Cham

[2] Guo, Z., Zhang, L., & Zhang, D. (2010). A completed modeling of local binary pattern operator for texture

classification. IEEE Transactions on Image Processing, 19(6), 1657-1663.

[3] Tsai, G. (2010). Histogram of oriented gradients. University of Michigan.

[4] Serre, T., Wolf, L., Bileschi, S., Riesenhuber, M., & Poggio, T. (2007). Robust object recognition with cortex-like

mechanisms. IEEE transactions on pattern analysis and machine intelligence, 29(3), 411-426.

[5] Abdi, H., & Williams, L. J. (2010). Principal component analysis. Wiley interdisciplinary reviews: computational

statistics, 2(4), 433-459.

[6] Chen, G., Florero-Salinas, W., & Li, D. (2017, May). Simple, fast and accurate hyper-parameter tuning in Gaussian-kernel

SVM. In Neural Networks (IJCNN), 2017 International Joint Conference on (pp. 348-355). IEEE.

[7] Raschka,S. (2014,August). Linear Discriminat Analysis-Bit by Bit. Sebastianraschka

http://sebastianraschka.com/Articles/2014_python_lda.html

[8] Karl Ricanek and Tamirat Tesafaye. Morph: A longitudinal image database of normal adult age-progression. In

Automatic Face and Gesture Recognition, 2006. FGR 2006. 7th International Conference on, pages 341–345. IEEE, 2006.

ACKNOWLEDGMENT

This research was supported by NSF, DMS grant number 1659288. Special thanks goes out to Dr. Cuixian Chen, Dr. Yishi

Wang and Troy Kling