C OMPUTATION M ODEL FOR V ISUAL C ATEGORIZATION Bhuwan Dhingra

COMPUTATION MODEL FOR VISUAL CATEGORIZATIONBhuwan Dhingra

OVERVIEW

Objective: To study the hierarchy of object categorization using a computational model for vision.

Three levels of categorization – super-ordinate, basic and subordinate.

Basic level categories – maximize cue validities, and dominate any taxonomy.

Categorization implemented in unsupervised manner in the current model.

HYPOTHESES

Rosch et al, [1], claim that basic level categories accessed first.

Marc and Joubert, [2], claim that in a purely visual task super-ordinate categories accessed first.

Role of expertise emphasized several times in the literature, [3].

THE MODEL

Bag-of-Features:

THE MODEL

Extracted histograms clustered in an unsupervised manner using k-means algorithm.

Distance metric used – (1-correlation(h1,h2)), where h1 and h2 are two histograms.

DATASET

30 images for each subordinate category using Google image search of the keywords.

DATASET

FurnitureAnimal

TableChairBirdDog

Coffee Table

Picnic Table

Rocking Chair

Bar-stool

Pigeon

Foxhound

Dalmation

Super-ordinate classes

Basic classes

Sub-ordinate classes

Test 1: Study which type of categorization dominates as the number of detected key-points is varied.

Test 2: Study how the performance of the categorization changes with the number of images.

Test 3: Study the effect of increasing the number of images of one basic category compared to others

Different categorizations were implemented by setting k = 2,4,8.

PERFORMANCE INDICES

Rand Index:

TP, TN, FP, FN are true positive and negatives, and false positives and negatives.

Purity: Percentage of correctly assigned points, assuming majority class for each cluster.

Normalized Mutual Information: Information theoretic mutual information between clusters and classes (normalized to 1).

Silhouette Index: Based on the ratio of the within class scatter to between class scatter.

RESULTS Variation of the performance metrics with

Peak Threshold or the number of key-points detected.

-0.01 0 0.01 0.02 0.03 0.04 0.05 0.06 0.070.35

Peak Threshold

Purity vs Peak Threshold

Super-ordinateBasicSub-ordinate

-0.01 0 0.01 0.02 0.03 0.04 0.05 0.06 0.070.02

Peak Threshold

Silhouette Index vs Peak Threshold

-0.01 0 0.01 0.02 0.03 0.04 0.05 0.06 0.070.05

Peak Threshold

Rand Index vs Peak Threshold

-0.01 0 0.01 0.02 0.03 0.04 0.05 0.06 0.070

Peak Threshold

NMI vs Peak Threshold

-0.01 0 0.01 0.02 0.03 0.04 0.05 0.06 0.070.35

Peak Threshold

-0.01 0 0.01 0.02 0.03 0.04 0.05 0.06 0.070.02

Peak Threshold

-0.01 0 0.01 0.02 0.03 0.04 0.05 0.06 0.070.05

Peak Threshold

-0.01 0 0.01 0.02 0.03 0.04 0.05 0.06 0.070

Peak Threshold

-0.01 0 0.01 0.02 0.03 0.04 0.05 0.06 0.070.35

Peak Threshold

-0.01 0 0.01 0.02 0.03 0.04 0.05 0.06 0.070.02

Peak Threshold

-0.01 0 0.01 0.02 0.03 0.04 0.05 0.06 0.070.05

Peak Threshold

-0.01 0 0.01 0.02 0.03 0.04 0.05 0.06 0.070

Peak Threshold

RESULTS Variation of performance metrics with

number of images:

10 15 20 25 30

Images per sub-ordinate category

NMI vs Number of Images

10 15 20 25 300.4

Purity vs Number of Images

10 15 20 25 30

Rand Index vs Number of Images

10 15 20 25 300.065

Silhoutte Index vs Number of Images

RESULTS Effect of expertise Two subordinate and one basic level categories

taken together, ex: {{dalmation, foxhound}, bird} Trial 1: Training samples of subordinate categories

half of basic categoryTrial 2: Training samples of subordinate category equal to basic category

30 600

Number of images in Basic Category

Effect of Expertise

dogbirdchairtable

SOME PROBLEMS

White background images sometimes classified separate from cluttered background. Solution: Foreground extraction

High variability in Normalized Mutual Information (NMI)

Effect of expertise not clear Solution: Test for exponential increase in

images

REFERENCES

[1] Rosch, E., Mervis, C., Gray, W., Johnson, D., & Boyes-Braem, P. (1976). Basic objects in natural categories. Cognitive Psychology.

[2] Marc, J.M.M., Joubert, O.R., Nespoulous, J.L. & Fabre-Thorpe, M (2009). The time-course of visual categorizations: you spot the animal faster than the bird. PLoS one.

[3] Johnson, K.E., Mervis, C.B. (1997). Effects of varying levels of expertise on the basic level of categorization. Journal of Expert Psychology.

C OMPUTATION M ODEL FOR V ISUAL C ATEGORIZATION Bhuwan Dhingra

Documents

Infusing technology in the v isual arts classroom

Bhuwan Tequila 2

6. Electrical - Ijeeer - An Investigation on Use of Power System - Bhuwan Singh

A UTOMATIC C OMPUTATION OF CDR U SING F UZZY C LUSTERING T ECHNIQUES

Secure Object-based Coding v isual Privacy protection solution

When Cuing V isual Search, Sometimes M ore Is Less

Pr edicting the Unobser vable V isual 3D T racking with a ...nah/bibtex/papers/morwald2011icra.pdf · Pr edicting the Unobser vable V isual 3D T racking with a Probabilistic Motion

Statistics, Time Series, omputation Finance, erivative ...ronnnwu.github.io/old_version/note/pdf/finance/computeFinance.pdf · 1 Statistics, Time Series, omputation Finance, erivative

C at V isual D isse ction G uide - VWR International

ISUAL System Design H. Heetderks. PDR 31 August 2000NCKU UCB Tohoku ISUAL System Design H. Heetderks…

V isual Slope V6 · V isual Slope V6. ii ... Visual Slope is developed based on the most commonly accepted design theories, ... The Visual Slope seepage analysis module is capable

Visual Basic 4.0 V Microsoft isual Basicalumni.media.mit.edu/~arnans/resources/pdf/vb.pdf · isual Basic ⁄ÙŁ`×˝»ˆ—¡˝”¡Òˆ˝”ˆ` ÀÒ⁄˙Ô“Ò˙Ô¨˙¡ˆˆ`⁄˝`¾Ô˙àµ˝ˆì

Bhuwan Pant

ISUAL Data Formats & Science Data Processing S. Geller

C OMPLEXITY - THEORETIC F OUNDATIONS OF S TEGANOGRAPHY AND C OVERT C OMPUTATION Daniel Apon

ISUAL Instrument Software S. Geller. CDR July, 2001NCKU UCB Tohoku ISUAL Instrument Software S. Geller 2 Topics Presented Software Functions SOH Telemetry

Bhuwan Deepak

DIS tributed CO ntent-based V isual I nformation R etrieval

V isual field

Bhuwan nrhm