1
Im age Set E igenface Fisherface EGM SVM NN FRCM centerlight 53.3% 93.3% 66.7% 86.7% 73.3% 93.3% glasses 80.0% 100.0% 53.3% 86.7% 86.7% 100.0% happy 93.3% 100.0% 80.0% 100.0% 93.3% 100.0% leftlight 26.7% 26.7% 33.3% 26.7% 26.7% 33.3% noglasses 100.0% 100.0% 80.0% 100.0% 100.0% 100.0% norm al 86.7% 100.0% 86.7% 100.0% 93.3% 100.0% rightlight 26.7% 40.0% 40.0% 13.3% 26.7% 33.3% sad 86.7% 93.3% 93.3% 100.0% 93.3% 100.0% sleepy 86.7% 100.0% 73.3% 100.0% 100.0% 100.0% surprised 86.7% 66.7% 33.3% 73.3% 66.7% 86.7% wink 100.0% 100.0% 66.7% 93.3% 93.3% 100.0% A verage 75.2% 83.6% 64.2% 80.0% 77.6% 86.1% N o light 85.9% 94.8% 70.4% 93.3% 88.9% 97.8% Table 3:Yale Result Table 2:ORL Result Im age Set Eigenface Fisherface EGM SVM NN FRCM 1 92.5% 100.0% 90.0% 95.0% 92.5% 95.0% 2 85.0% 100.0% 72.5% 100.0% 95.0% 100.0% 3 87.5% 100.0% 85.0% 100.0% 95.0% 100.0% 4 90.0% 97.5% 70.0% 100.0% 92.5% 100.0% 5 85.0% 100.0% 82.5% 100.0% 95.0% 100.0% 6 87.5% 97.5% 70.0% 97.5% 92.5% 97.5% 7 82.5% 95.0% 75.0% 95.0% 95.0% 100.0% 8 92.5% 95.0% 80.0% 97.5% 90.0% 97.5% 9 90.0% 100.0% 72.5% 97.5% 90.0% 100.0% 10 85.0% 97.5% 80.0% 95.0% 92.5% 97.5% Average 87.8% 98.3% 77.8% 97.8% 93.0% 98.8% Introduction System Architecture The Approach and Experimental Results Face Processing System Based on Committee Machine: Kim-Fung Jang, Ho-Man Tang, Michael R. Lyu and Irwin K Kim-Fung Jang, Ho-Man Tang, Michael R. Lyu and Irwin K ing ing Department of Computer Science and Engineering Department of Computer Science and Engineering The Chinese University of Hong Kong, Shatin, N.T. Hong The Chinese University of Hong Kong, Shatin, N.T. Hong Kong SAR. Kong SAR. {kfjang,hmtang,lyu,king}@cse.cuhk.edu.hk {kfjang,hmtang,lyu,king}@cse.cuhk.edu.hk Department of Computer Science and Department of Computer Science and Engineering Engineering The Chinese University of Hong Kong The Chinese University of Hong Kong This paper describes a heterogeneous committee machine for face processing regarding face detection and recognition. Our proposed system consists of two components, Face Detection Committee Machine (FDCM) and Face Recognition Committee Machine (FRCM), that employs three well-known and five state-of-the-art approaches, respectively, as members. Experimental results are provided to demonstrate the improved performance of FDCM and FRCM over the individual experts. The system consists of three modules as shown in Figure 1: • Pre-processing Module employs the elliptical CrCb color model to reduce the search space for finding face candi date regions. Face Detection Module applies FDCM to verify each searc h window in the region. • Face Recognition Module recognizes the identity of the region by using FRCM. Table 1:CBCL Result True Positive False A lram R ate Rate NN SN oW SVM FDCM 10% 0.56% 0.41% 0.05% 0.02% 20% 1.37% 1.09% 0.16% 0.07% 30% 2.54% 1.67% 0.44% 0.14% 40% 4.11% 2.92% 0.83% 0.41% 50% 6.32% 4.91% 1.60% 0.77% 60% 9.47% 8.47% 3.07% 1.41% 70% 13.89% 14.67% 5.98% 3.90% 80% 26.97% 27.62% 12.32% 7.79% 90% 48.95% 49.26% 28.60% 22.92% FDCM • The FDCM works according to the Confidence T j of each expe rt i. Figure 2 shows the distribution of the confidence v alue varies in a large range. The value T i is normalized b y using the statistical information obtained from the tra ining data: where and are the mean and standard derivation of training data from expert i, respectively. The reasons for the normalization are that the confidence value is not a uniform function and that not all the expe rts have a fixed range of confidence value e.g., [-1,1] o r [0,1]. The output value of the FDCM is calculated using equation: where is the criteria factor for expert i and is the weight of the expert i. The data is classified as fac e when the value of is larger than 0 and non-face patt ern when the value is smaller than or equal to 0. i i i i T / ) ( ), * ( * i i i i i i w i i i i w i The FRCM adopts a static committee machine structure. Ea ch expert gives its Result r and Confidence c for the re sult to the voting machine. By combing them with the We ight w of each expert, recognized class is chosen with the highest Score s among j classes, which is defined as: Weight w of each expert is defined as the average perfo rmance p for the expert evaluated in cross-validation te sting in Table 2 and Table 3. The weight of expert i is normalized by: FRCM 5 1 ) ( ), ( * ) ( ) ( i i r j i c i w j s 5 1 ) exp( ) exp( ) ( i i i p p i w Result & Confident We introduce the use of Confidence c as a weighted vote for the voting machine to avoid low confidence Result r of individual expert from affecting the final result. W e adopt different approaches to find the result and con fidence. Eigenface, Fisherface and EGM: We use K nearest-neighb or classifiers. The final result for expert i is defined as the class j with the highest votes v in J classes amon g the five results: where its confidence is defined as the # of votes of th e result class divided by K. SVM: To recognize a test image in J different classes, J C 2 SVMs are constructed. The image is tested against eac h SVM and the class j with the highest votes in all SVMs is selected as the recognition result r. The confidence is defined as the number of votes of the result class d ivided by the maximum number of votes (J-1): NN: We choose a binary vector of size J for the target representation. The output value is chosen as the confi dence. 1 )) ( ( ) ( J i r v i c )) ( argmax( ) ( j v i r K i r v i c )) ( ( ) ( Figure 1: System Architecture Figure 3: ROC curves of three approaches and FDCM Figure 2: The distribution of confident value of the training data of face detection

Table 3:Yale Result Table 2:ORL Result Introduction System Architecture The Approach and Experimental Results A Face Processing System Based on Committee

Embed Size (px)

Citation preview

Page 1: Table 3:Yale Result Table 2:ORL Result Introduction System Architecture The Approach and Experimental Results A Face Processing System Based on Committee

Image Set Eigenface Fisherface EGM SVM NN FRCM

centerlight 53.3% 93.3% 66.7% 86.7% 73.3% 93.3%glasses 80.0% 100.0% 53.3% 86.7% 86.7% 100.0%happy 93.3% 100.0% 80.0% 100.0% 93.3% 100.0%leftlight 26.7% 26.7% 33.3% 26.7% 26.7% 33.3%

noglasses 100.0% 100.0% 80.0% 100.0% 100.0% 100.0%normal 86.7% 100.0% 86.7% 100.0% 93.3% 100.0%

rightlight 26.7% 40.0% 40.0% 13.3% 26.7% 33.3%sad 86.7% 93.3% 93.3% 100.0% 93.3% 100.0%

sleepy 86.7% 100.0% 73.3% 100.0% 100.0% 100.0%surprised 86.7% 66.7% 33.3% 73.3% 66.7% 86.7%

wink 100.0% 100.0% 66.7% 93.3% 93.3% 100.0%Average 75.2% 83.6% 64.2% 80.0% 77.6% 86.1%No light 85.9% 94.8% 70.4% 93.3% 88.9% 97.8%

Table 3:Yale ResultTable 2:ORL Result

Image Set Eigenface Fisherface EGM SVM NN FRCM

1 92.5% 100.0% 90.0% 95.0% 92.5% 95.0%2 85.0% 100.0% 72.5% 100.0% 95.0% 100.0%3 87.5% 100.0% 85.0% 100.0% 95.0% 100.0%4 90.0% 97.5% 70.0% 100.0% 92.5% 100.0%5 85.0% 100.0% 82.5% 100.0% 95.0% 100.0%6 87.5% 97.5% 70.0% 97.5% 92.5% 97.5%7 82.5% 95.0% 75.0% 95.0% 95.0% 100.0%8 92.5% 95.0% 80.0% 97.5% 90.0% 97.5%9 90.0% 100.0% 72.5% 97.5% 90.0% 100.0%10 85.0% 97.5% 80.0% 95.0% 92.5% 97.5%

Average 87.8% 98.3% 77.8% 97.8% 93.0% 98.8%

Introduction

System Architecture

The Approach and Experimental Results

A Face Processing System Based on Committee Machine:

Kim-Fung Jang, Ho-Man Tang, Michael R. Lyu and Irwin KingKim-Fung Jang, Ho-Man Tang, Michael R. Lyu and Irwin KingDepartment of Computer Science and EngineeringDepartment of Computer Science and Engineering

The Chinese University of Hong Kong, Shatin, N.T. Hong Kong SAR.The Chinese University of Hong Kong, Shatin, N.T. Hong Kong SAR.{kfjang,hmtang,lyu,king}@cse.cuhk.edu.hk{kfjang,hmtang,lyu,king}@cse.cuhk.edu.hk

Department of Computer Science and EngineeringDepartment of Computer Science and Engineering The Chinese University of Hong KongThe Chinese University of Hong Kong

This paper describes a heterogeneous committee machine for face processing regarding face detection and recognition. Our proposed system consists of two components, Face Detection Committee Machine (FDCM) and Face Recognition Committee Machine (FRCM), that employs three well-known and five state-of-the-art approaches, respectively, as members. Experimental results are provided to demonstrate the improved performance of FDCM and FRCM over the individual experts.

The system consists of three modules as shown in Figure 1:• Pre-processing Module employs the elliptical CrCb color model to reduce the sear

ch space for finding face candidate regions.• Face Detection Module applies FDCM to verify each search window in the region.• Face Recognition Module recognizes the identity of the region by using FRCM.

Table 1:CBCL ResultTrue Positive False Alram Rate

Rate NN SNoW SVM FDCM

10% 0.56% 0.41% 0.05% 0.02%20% 1.37% 1.09% 0.16% 0.07%30% 2.54% 1.67% 0.44% 0.14%40% 4.11% 2.92% 0.83% 0.41%50% 6.32% 4.91% 1.60% 0.77%60% 9.47% 8.47% 3.07% 1.41%70% 13.89% 14.67% 5.98% 3.90%80% 26.97% 27.62% 12.32% 7.79%90% 48.95% 49.26% 28.60% 22.92%

FDCM• The FDCM works according to the Confidence Tj of each expert i. Figure 2 shows th

e distribution of the confidence value varies in a large range. The value Ti is normalized by using the statistical information obtained from the training data:

where and are the mean and standard derivation of training data from expert i, respectively.

• The reasons for the normalization are that the confidence value is not a uniform function and that not all the experts have a fixed range of confidence value e.g., [-1,1] or [0,1].

• The output value of the FDCM is calculated using equation:

where is the criteria factor for expert i and is the weight of the expert i. The data is classified as face when the value of is larger than 0 and non-face pattern when the value is smaller than or equal to 0.

iiii T /)(

),*(* iiii

ii w

i i

i iwi

The FRCM adopts a static committee machine structure. Each expert gives its Result r and Confidence c for the result to the voting machine. By combing them with the Weight w of each expert, recognized class is chosen with the highest Score s among j classes, which is defined as:

Weight w of each expert is defined as the average performance p for the expert evaluated in cross-validation testing in Table 2 and Table 3. The weight of expert i is normalized by:

FRCM

5

1

)(),(*)()(i

irjiciwjs

5

1)exp(

)exp()(

i i

i

p

piw

Result & ConfidentWe introduce the use of Confidence c as a weighted vote for the voting machine to avoid low confidence Result r of individual expert from affecting the final result. We adopt different approaches to find the result and confidence.

• Eigenface, Fisherface and EGM: We use K nearest-neighbor classifiers. The final result for expert i is defined as the class j with the highest votes v in J classes among the five results:

where its confidence is defined as the # of votes of the result class divided by K.

• SVM: To recognize a test image in J different classes, JC2 SVMs are constructed. The image is tested against each SVM and the class j with the highest votes in all SVMs is selected as the recognition result r. The confidence is defined as the number of votes of the result class divided by the maximum number of votes (J-1):

• NN: We choose a binary vector of size J for the target representation. The output value is chosen as the confidence.

1

))(()(

J

irvic

))(argmax()( jvir

K

irvic

))(()(

Figure 1: System Architecture

Figure 3: ROC curves of three approaches and FDCM

Figure 2: The distribution of confident value of the training data of face

detection