Computer Vision Group University of California Berkeley Visual Grouping and Object Recognition...

Computer Vision GroupUniversity of California Berkeley

Visual Grouping and Object Recognition

Jitendra Malik*

U.C. Berkeley

* with S. Belongie, C. Fowlkes, T. Leung, D. Martin, G. Mori, J. Puzicha, J.Shi, X. Ren

From images/video to objects

Labeled sets: tiger, grass etc

Consistency

• A,C are refinements of B• A,C are mutual refinements • A,B,C represent the same percept

• Attention accounts for differences

BG L-bird R-bird

grass bush

headeye

beakfar body

headeye

beak body

Perceptual organization forms a tree:

Two segmentations are consistent when they can beexplained by the samesegmentation tree (i.e. theycould be derived from a single perceptual organization).

Outline

• Finding boundaries

• Recognizing objects

• Recognizing actions

Finding boundaries: Is texture a problem or a solution?

image orientation energy

Statistically optimal contour detection

• Use humans to segment a large collection of natural images.

• Train a classifier for the contour/non-contour classification using orientation energy and texture gradient as features.

Orientation Energy

• Gaussian 2nd derivative and its Hilbert pair

• Can detect combination of bar and edge features [Perona & Malik 90]

22 )()( evenodd fIfIOE

Texture gradient = Chi square distance between texton histograms in half disks across edge

jiji mhmh

mhmhhh

)]()([

1),(Chi-square

ROC curve for local boundary detection

Outline

Biological Shape

• D’Arcy Thompson: On Growth and Form, 1917– studied transformations between shapes of organisms

Deformable Templates: Related Work

• Fischler & Elschlager (1973)

• Grenander et al. (1991)

• von der Malsburg (1993)

Matching Framework

• Find correspondences between points on shape

• Fast pruning

• Estimate transformation & measure similarity

model target

Comparing Pointsets

Shape ContextCount the number of points inside each bin, e.g.:

Count = 4

Count = 10

Compact representation of distribution of points relative to each point

Shape Context

Comparing Shape Contexts

Compute matching costs using Chi Squared distance:

Recover correspondences by solving linear assignment problem with costs Cij

[Jonker & Volgenant 1987]

Matching Framework

• Fast pruning

model target

Fast pruning

• Find best match for the shape context at only a few random points and add up cost

),(minarg

),(),(

jqueryui

jiquery

SCSCSC

SCSCSSdist

Matching Framework

• Fast pruning

model target

• 2D counterpart to cubic spline:

• Minimizes bending energy:

• Solve by inverting linear system

• Can be regularized when data is inexact

Thin Plate Spline Model

Duchon (1977), Meinguet (1979), Wahba (1991)

MatchingExample

model target

Outlier Test Example

Object Recognition Experiments

• Handwritten digits

• COIL 3D objects (Nayar-Murase)

• Human body configurations

• Trademarks

Terms in Similarity Score• Shape Context difference

• Local Image appearance difference– orientation– gray-level correlation in Gaussian window– … (many more possible)

• Bending energy

Handwritten Digit Recognition

• MNIST 60 000: – linear: 12.0%

– 40 PCA+ quad: 3.3%

– 1000 RBF +linear: 3.6%

– K-NN: 5%

– K-NN (deskewed): 2.4%

– K-NN (tangent dist.): 1.1%

– SVM: 1.1%

– LeNet 5: 0.95%

• MNIST 600 000 (distortions): – LeNet 5: 0.8%– SVM: 0.8%– Boosted LeNet 4: 0.7%

• MNIST 20 000: – K-NN, Shape Context

matching: 0.63%

COIL Object Database

Prototypes Selected for 2 Categories

Details in Belongie, Malik & Puzicha (NIPS2000)

Error vs. Number of Views

Human body configurations

Deformable Matching

• Kinematic chain-based deformation model

• Use iterations of correspondence and deformation

• Keypoints on exemplars are deformed to locations on query image

Results

Trademark Similarity

Recognizing objects in scenes

Outline

Examples of Actions• Movement and posture change

– run, walk, crawl, jump, hop, swim, skate, sit, stand, kneel, lie, dance (various), …

• Object manipulation– pick, carry, hold, lift, throw, catch, push, pull, write, type, touch, hit,

press, stroke, shake, stir, turn, eat, drink, cut, stab, kick, point, drive, bike, insert, extract, juggle, play musical instrument (various)…

• Conversational gesture– point, …

• Sign Language

Key cues for action recognition

• “Morpho-kinesics” of action (shape and movement of the body)

• Identity of the object/s

• Activity context

Image/Video Stick figure Action

• Stick figures can be specified in a variety of ways or at various resolutions (deg of freedom)– 2D joint positions– 3D joint positions– Joint angles

• Complete representation

• Evidence that it is effectively computable

Tracking by Repeated Finding

Achievable goals in 3 years

• Reasonable competence at object recognition at crude category level (~1000)

• Detection/Tracking of humans as kinematic chains, assuming adequate resolution.

• Recognition of ~10-100 actions and compositions thereof.

Computer Vision Group University of California Berkeley Visual Grouping and Object Recognition...

Documents

Convolutional Networks with Adaptive Computation Graphs...Convolutional Networks with Adaptive Inference Graphs Andreas Veit Serge Belongie Department of Computer Science & Cornell

RecommendationswithFeedback - Berkeley Haasfaculty.haas.berkeley.edu/manso/rf.pdf · RecommendationswithFeedback∗ GaneshIyer UniversityofCalifornia,Berkeley GustavoManso UniversityofCalifornia,Berkeley

A Database of Human Segmented Natural Images and Two Applications David Martin, Charless Fowlkes, Doron Tal, Jitendra Malik UC Berkeley {dmartin,fowlkes,doron,malik}@eecs.berkeley.edu

Computer Vision Group University of California Berkeley 1 Learning Scale-Invariant Contour Completion Xiaofeng Ren, Charless Fowlkes and Jitendra Malik

BERKELEY HEIGHTS PUBLIC SCHOOLS BERKELEY HEIGHTS, …

Monitoring Creatures Great and Small: Computer Vision Systems …mori/research/papers/mori_animal... · 2019. 7. 31. · VS-PETS , 2005. [5]C. Fowlkes, S. Belongie, F. Chung, and

Objects in Context - Carnegie Mellon School of … · Analysis: Objects in Context [Rabinovich, Vedaldi, Galleguillos, Wiewiora, Belongie] & Object Categorization using Co-Occurrence,

Model–basedHalftoning for Color Image Segmentationcseweb.ucsd.edu/~sjb/icpr00.pdfModel–basedHalftoning for Color Image Segmentation Jan Puzicha and Serge Belongie UC Berkeley,

Alida Harper Fowlkes papers

Boris Babenko, Steve Branson, Serge Belongie University of California, San Diego ICCV 2009, Kyoto, Japan

Downtown Berkeley Development Feasibility StudyCity of Berkeley City Council Meeting Downtown Berkeley Development Feasibility Study City of Berkeley City

Cue Integration in Figure/Ground Labeling Xiaofeng Ren, Charless Fowlkes and Jitendra Malik, U.C. Berkeley We present a model of edge and region grouping

Matching Shapes - EECS at UC Berkeley · PDF fileEighth IEEE International Conference on Computer Vision (July 2001) Matching Shapes Serge Belongie, Jitendra Malik and Jan Puzicha

Layered Object Detection for Multi-Class Image Segmentation UC Irvine Yi Yang Sam Hallman Deva Ramanan Charless Fowlkes

Fowlkes Hazmat Presentation

Charless C. Fowlkesfowlkes/cv.pdf · 2019. 12. 16. · Charless C. Fowlkes Contact Information 4076 Donald Bren Hall 949.824.6945 University of California, Irvine 92697 fowlkes@ics.uci.edu

ADS lab NCKU1 Michael Maire, Pablo Arbelaez, Charless Fowlkes, and Jitendra Malik university of California, Berkeley – Berkeley university of California,

Antón R. Escobedo cse 252c Behavior Recognition via Sparse Spatio-Temporal Features Piotr Dollár Vincent Rabaud Garrison CottrellSerge Belongie

Carolina Galleguillos and Serge Belongie Department of Computer Science and Engineering, UCSD {cgallegu,sjb}@cs.ucsd.edu Grocery shopping is a common activity

Computer Vision Group University of California Berkeley Matching Shapes Serge Belongie , Jitendra Malik and Jan Puzicha U.C. Berkeley Present address: