8
8 June 2010 1 ICEIS 2010 Pattern Recognition as a Human Centered non-Euclidean Problem ICEIS, Funchal, Madeira, 9 June 2010 Robert P.W. Duin, Delft University of Technology (In cooperation with ElŜbieta Pȩkalska, Univ. of Manchester) Pattern Recognition Lab Delft University of Technology, The Netherlands prlab.tudelft.nl 8 June 2010 2 ICEIS 2010 D = 0.3 D = 0.3 D = 1 David W. Jacobs , Daphna Weinshall and Yoram Gdalyahu , Classification with Nonmetric Distances: Image Retrieval and Class Representation, IEEE Trans. Pattern Anal. Mach. Intell, 22(6), pp. 583-600, 2000. 8 June 2010 3 ICEIS 2010 Human Judgments … … often don’t fit in an Euclidean world. D = 0.3 D = 0.3 D = 1 A B C D(A,B) < D(A,C) + D(C,B) violation of the triangle inequality 8 June 2010 4 ICEIS 2010 Learning about the world Human knowledge grows in the debate between -those who see the patterns, and -those who know the universal laws 8 June 2010 5 ICEIS 2010 Automatic Pattern Recognition Representation Generalization Sensor B B A A Can we replace our recognition of real world objects by a formal system, also when our mental system is non-Euclidean? How to train? How to apply? 8 June 2010 6 ICEIS 2010 Blob Recognition Which group? A B C

as a Human Centered non-Euclidean Problem€¦ · Pattern Recognition System Sensor Representation Generalization B A perimeter area perimeter area Feature Representation. 8 June

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: as a Human Centered non-Euclidean Problem€¦ · Pattern Recognition System Sensor Representation Generalization B A perimeter area perimeter area Feature Representation. 8 June

8 June 2010 1ICEIS 2010

Pattern Recognition as a

Human Centered non-Euclidean Problem

ICEIS, Funchal, Madeira, 9 June 2010

Robert P.W. Duin, Delft University of Technology

(In cooperation with ElŜbieta Pȩkalska, Univ. of Manchester)

Pattern Recognition Lab

Delft University of Technology, The Netherlands

prlab.tudelft.nl

8 June 2010 2ICEIS 2010

D = 0.3D = 0.3

D = 1

David W. Jacobs, Daphna Weinshall and Yoram Gdalyahu, Classification with Nonmetric Distances: Image Retrieval and Class Representation, IEEE Trans. Pattern Anal. Mach. Intell, 22(6), pp. 583-600, 2000.

8 June 2010 3ICEIS 2010

Human Judgments …

… often don’t fit in an Euclidean world.

D = 0.3 D = 0.3

D = 1A B

C

D(A,B) < D(A,C) + D(C,B)

violation of the

triangle inequality

8 June 2010 4ICEIS 2010

Learning about the world

Human knowledge grows

in the debate between

-those who see the patterns, and

-those who know the universal laws

8 June 2010 5ICEIS 2010

Automatic Pattern Recognition

RepresentationGeneralization Sensor

BB

AA

Can we replace our recognition of real world objects

by a formal system, also when our mental system isnon-Euclidean? How to train? How to apply?

8 June 2010 6ICEIS 2010

Blob Recognition

Which group?

A

B

C

Page 2: as a Human Centered non-Euclidean Problem€¦ · Pattern Recognition System Sensor Representation Generalization B A perimeter area perimeter area Feature Representation. 8 June

8 June 2010 7ICEIS 2010

Object Recognition

Airplane

Bicycle

Bus

Car

Train

8 June 2010 8ICEIS 2010

Pattern Recognition: Speech

8 June 2010 9ICEIS 2010

Experts confused by details

no. of symptoms

Diagnostic accuracy

The diagnostic classification accuracy of a

group of doctors. Sample size is 100.

F.T de Dombal, Computer-assisted diagnosis, in: Principals and practice of medical

computing, Whitby and Lutz (eds.), Churchill Livingstone, London, 1971, pp 179 - 199.

Performance �

8 June 2010 10ICEIS 2010

Performance �

Number of details �

Computers confused by details

Character recognition classification performance as a function of the number of n-tuples used.

J.R. Ullmann, Experiments with the n-tuple method of pattern recognition, IEEE Trans. on Computers, 1969, 1135-1136

8 June 2010 11ICEIS 2010

Pattern Recognition: Shapes

Examples of objects for different classes

Object of unknown class to be classified

A B?

8 June 2010 12ICEIS 2010

Pattern Recognition System

Representation GeneralizationSensor

B

A

B

A

perimeter

area

perimeter

area

Feature Representation

Page 3: as a Human Centered non-Euclidean Problem€¦ · Pattern Recognition System Sensor Representation Generalization B A perimeter area perimeter area Feature Representation. 8 June

8 June 2010 13ICEIS 2010

Measuring Human Relevant Information

A

B

AAA AAA BB B B

Nearest neighbours sorted:

8 June 2010 14ICEIS 2010

Pixel Representation

FeaturesShapes

MomentsFourier descriptors

FacesMorphology

Pixels1x

2x

16 x 16R256

Pixels are more general, initially complete representation

Large datasets are available � good results for OCR

8 June 2010 15ICEIS 2010

Peaking Phenomenon, OvertrainingCurse of Dimensionality, Rao’s Paradox

feature set size (dimensionality)

classifier complexity

training set size

Classification error

Pattern Recognition Paradox

8 June 2010 16ICEIS 2010

The Connectivity Problem in the Pixel Representation

Dependent (connected) measurements are represented independently.The dependency has to be refound from the data.

x3x2x1

Image

X1

X2

X3

Images in pixel space

8 June 2010 17ICEIS 2010

The Connectivity Problem in the Pixel Representation

Feature Space

image_1 image_3image_2

class subspace

Interpolation does not yield valid objects 8 June 2010 18ICEIS 2010

Representations

Features – details lost

Pixels – no connectivity

Dissimilarities – shape dependent

Page 4: as a Human Centered non-Euclidean Problem€¦ · Pattern Recognition System Sensor Representation Generalization B A perimeter area perimeter area Feature Representation. 8 June

8 June 2010 19ICEIS 2010

Examples Dissimilarity Measures

A B

Dist(A,B):

a ∈ A, points of Ab ∈ B, points of Bd(a,b): Euclidean distance

D(A,B) = max_a{min_b{d(a,b)}}

D(B,A) = max_b{min_a{d(b,a)}}

Hausdorff Distance (metric):

DH = max{max_a{min_b{d(a,b)}} , max_b{min_a{d(b,a)}}}

Modified Hausdorff Distance (non-metric):

DM = max{mean_a{min_b{d(a,b)}},mean_b{min_a{d(b,a)}}}

maxB

A

max

B

A

D(A,B) ≠ D(B,A)

8 June 2010 20ICEIS 2010

Examples Dissimilarity Measures

Matching new objects to various templates:

class(x) = class(argminy(D(x,y)))

Dissimilarity measure appears to be non-metric.

A.K. Jain, D. Zongker, Representation and recognition of handwritten digit using deformable templates, IEEE-PAMI, vol. 19, no. 12, 1997, 1386-1391.

8 June 2010 21ICEIS 2010

Dissimilarity Representation

=

77 76 75 74 7372 71

67 66 65 64 63 62 61

57 56 55 54 53 52 51

47 46 45 44 43 42 41

37 36 35 34 33 32 31

27 26 25 24 23 22 21

17 16 15 14 13 12 11

T

ddddddd

ddddddd

ddddddd

ddddddd

ddddddd

ddddddd

ddddddd

D

The traditional Nearest Neighbor rule (template matching) finds:label(argmintrainset{dxi}) ,

without using DT. Can we do any better?

Dissimilarities dij between

all training objects

Training set B

A

) d d d d d d d (d x7x6x5x4x3x2x1x =

Unlabeled object x to be classified

Not used by NN Rule

8 June 2010 22ICEIS 2010

Pattern Recognition System

Representation GeneralizationSensor

B

A

B

A

D(x,xA1)

D(x,x

B1)

Dissimilarity Representation

8 June 2010 23ICEIS 2010

Dissimilarities – Possible Assumptions

1. Positivity: dij ≥ 0

2. Reflexivity: dii = 0

3. Definiteness: dij = 0 objects i and j are identical

4. Symmetry: dij = dji

5. Triangle inequality: dij < dik + dkj

6. Compactness: if the objects i and j are very similar then dij < δ.

7. True representation: if dij < δ then the objects i and jare very similar.

8. Continuity of d.

Metric

8 June 2010 24ICEIS 2010

Alternatives for the Nearest Neighbor Rule

=

77 76 75 74 7372 71

67 66 65 64 63 62 61

57 56 55 54 53 52 51

47 46 45 44 43 42 41

37 36 35 34 33 32 31

27 26 25 24 23 22 21

17 16 15 14 13 12 11

T

ddddddd

ddddddd

ddddddd

ddddddd

ddddddd

ddddddd

ddddddd

D

Dissimilarities dij between

all training objects

Training set B

A

) d d d d d d d (d x7x6x5x4x3x2x1x =

Unlabeled object x to be classified

1. Dissimilarity Space

2. Embedding

Pekalska, The dissimilarity representation for PR.World Scientific, 2005.

Page 5: as a Human Centered non-Euclidean Problem€¦ · Pattern Recognition System Sensor Representation Generalization B A perimeter area perimeter area Feature Representation. 8 June

8 June 2010 25ICEIS 2010

Alternative 1: Dissimilarity Space

=

77 76 75 74 7372 71

67 66 65 64 63 62 61

57 56 55 54 53 52 51

47 46 45 44 43 42 41

37 36 35 34 33 32 31

27 26 25 24 23 22 21

17 16 15 14 13 12 11

T

ddddddd

ddddddd

ddddddd

ddddddd

ddddddd

ddddddd

ddddddd

D

) d d d d d d d (d x7x6x5x4x3x2x1x =

r1 r2 r3

Dissimilarities

Selection of 3 objects for representation

B

A

r1(d1)

r2(d4)

r3(d7)

Given labeled training set

Unlabeled object to be classified

8 June 2010 26ICEIS 2010

Dissimilarity Space Classification ���� Nearest Neighbor Rule

Modified Haussdorff distance on contours of digits

Dissimilarity based classification outperforms the nearest neighbor rule.

8 June 2010 27ICEIS 2010

Embedding

Training set

B

A � Dissimilarity matrix D � X

Is there a feature space for which Dist(X,X) = D ?

1x

2x

Position points in a vector space such that their Euclidean distances � D

8 June 2010 28ICEIS 2010

Embedding

8 June 2010 29ICEIS 2010

Euclidean - Non Euclidean - Non Metric

B

C

A

D

10

10

10

5.1

5.1

5.1

B

C

A

D

10

10

10

5.8

5.8

5.8

B

C

A

D

10

10

10

4

4

4

8 June 2010 30ICEIS 2010

Non-metric distances

14.9

7.8 4.1

object 78

object 419

object 425

Bunke’s Chicken Dataset

D(A,C)A

B

C

D(A,C) > D(A,B) + D(B,C)

D(A,B) D(B,C)

µA µB–

x

σA σB

A BC

Weighted-edit distance for strings Single-linkage clustering

2B

2A

2

BAB)J(A,σ+σµ−µ

= 0C)J(A, = largeB)J(A, =B)J(A,smallB)J(C, ≠=

Fisher criterion

Page 6: as a Human Centered non-Euclidean Problem€¦ · Pattern Recognition System Sensor Representation Generalization B A perimeter area perimeter area Feature Representation. 8 June

8 June 2010 31ICEIS 2010

(Pseudo-)Euclidean Embedding

m×m D is a given, imperfect dissimilarity matrix of training objects.

Construct inner-product matrix:

Eigenvalue Decomposition ,

Select k eigenvectors: (problem: Λk< 0)

Let ℑk be a k x k diag. matrix, ℑk(i,i) = sign(Λk(i,i))

Λk(i,i) < 0 → Pseudo-Euclidean

n×m Dz is the dissimilarity matrix between new objects and the training set.

The inner-product matrix:

The embedded objects:

JJDB (2)21−= 11m

1IJ −=TQQB Λ=

21

kkQX Λ=

)JD-J(DB )2(Tn1(2)

z21

z 11−=

kkkz21

QBZ ℑΛ= −

8 June 2010 32ICEIS 2010

PES: Pseudo-Euclidean Space (Krein Space)

If D is non-Euclidean, B has p positive and q negative eigenvalues.

A pseudo-Euclidean space ε with signature (p,q), k =p+q, is a non-degenerate inner product space ℜk = ℜp ⊕ ℜq such that:

∑∑+==

ε−=ℑ=

q

1pjjj

p

1iiipq

T yxyxyxy,x

−=ℑ

×

×

qq

pppq I0

0I

)y,x(d)y,x(dyx,yx)y,x(d 2q

2p

2 −=−−=εε

8 June 2010 33ICEIS 2010

Pseudo Euclidean Space

22ij i jd = −x x

2 22 p p q qij i j i jd = − − −x x x x

Pseudo Euclidean embedding D � {Xp,Xq}

Euclidean embedding D � X

‘Positive’ and ‘negative’ space,

Compare Minkowsky space in relativity theory

8 June 2010 34ICEIS 2010

Dissimilarity based classification procedured compared

1. Nearest Neighbour Rule

2. Reduce training set to representation set

⇒ dissimilarity space

3. Embedding:Select large Λii > 0 ⇒ Euclidean space

Select large |Λii| > 0 → pseudo-Euclidean space}

B

ATraining set

Test object x

� Dissimilarity matrix D

� Dissimilarities dx with training set

discriminant function

8 June 2010 35ICEIS 2010

Three Approaches Compared for the Zongker Data

Dissimilarity Space equivalent to Embedding better than Nearest Neighbour Rule

8 June 2010 36ICEIS 2010

Non-Euclidean Representations

• Why do we have them?

• Are they essential?

• Can we build classifiers for them?(to some extend)

• Can we transform them into Euclidean representations?(Yes, but at the cost of performance loss)

Beyond FeaturesSimilarity-based Pattern Analysis and Recognition

Page 7: as a Human Centered non-Euclidean Problem€¦ · Pattern Recognition System Sensor Representation Generalization B A perimeter area perimeter area Feature Representation. 8 June

8 June 2010 37ICEIS 2010

Computational Noise

Even for Euclidean distance matrices zero eigenvaluesmay show negative, e.g:

- X = N(50,20) : 50 points in 20 dimensions

- D = Dist(X): 50 x 50 distance matrix

- Expected: 49-20 = 29 zero eigenvalues

- Found: 15 negative eigenvalues

8 June 2010 38ICEIS 2010

Lack of information

1800:

Crossing the Jostedalsbreen was impossible.

Travelling around (200 km) lasted 5 days.

Untill the shared point X was found.

People could visit each other in 8 hours.

D(V,J) = 5 days

D(V,X) = 4 hours

D(X,J) = 4 hours

8 June 2010 39ICEIS 2010

Graph Matching ���� Dissimilarities

A

D

EB

F

C

E

D

C

BF

Representation by Connected Graphs

Graph ( Nodes, Edges, Attributes )

Distance (Graph_1, Graph_2 )

8 June 2010 40ICEIS 2010

Intrinsicly Non-Euclidean Dissimilarity MeasuresSingle Linkage

Distance(Table,Book) = 0

Distance(Table,Cup) = 0

Distance(Book,Cup) = 1

D(A,C)A

B

C

D(A,C) > D(A,B) + D(B,C)

D(A,B) D(B,C)

Single-linkage clustering

8 June 2010 41ICEIS 2010

Boundary distances

A set of boundary distances may characterize sets of datapoints:

Distances � features

8 June 2010 42ICEIS 2010

Intrinsicly Non-Euclidean Dissimilarity MeasuresMahalanobis

Pairwise comparison between

differently shaped data distributions

Different pairs � different comparison frameworks

� non-Euclidean

Page 8: as a Human Centered non-Euclidean Problem€¦ · Pattern Recognition System Sensor Representation Generalization B A perimeter area perimeter area Feature Representation. 8 June

8 June 2010 43ICEIS 2010

Intrinsicly Non-Euclidean Dissimilarity MeasuresInvariants

Object space

Non-metric object distances

due to invariants

A

B

C

D(A,C) > D(A,B) + D(B,C)

8 June 2010 44ICEIS 2010

Intrinsicly Non-Euclidean Dissimilarity Measures

Non-Euclidean human relations

8 June 2010 45ICEIS 2010

Objects may have an ‘inner life’

In dissimilarity measures the ‘inner life’ of objects may be taken into account (e.g. invariants).

� Objects cannot be represented anymore as points

� Non-Euclidean dissimilarities

8 June 2010 46ICEIS 2010

Conclusions

• Pseudo Euclidean Space (PES) is sometimes informative (corrections are not helpful).

• The corresponding problems may be intrinsic non-Euclidean

• Classifiers for non-Euclidean data have to be studied further