18
Pascal Kelm [email protected] Communication Systems Group Technische Universität Berlin Thursday, 04 October 2012 www.nue.tu-berlin.de

How Spatial Segmentation improves the Multimodal Geo-Tagging

Embed Size (px)

Citation preview

Page 1: How Spatial Segmentation improves the Multimodal Geo-Tagging

Pascal Kelm

[email protected]

Communication Systems Group

Technische Universität Berlin

Thursday, 04 October 2012

www.nue.tu-berlin.de

Page 2: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

What is meant by Spatial Segmentation?

World map is iteratively divided into segments of

different sizes

2

Page 3: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Run4: only audio/visual information

Descriptions are pooled for each spatial segment (k-d tree) in

the different hierarchy level

Visual nearest neighbour search in lowest hierarchy

3

Page 4: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Visual Region Model 4

Returns the visually most similar areas, which are

represented by a mean feature vector of all training images

and videos of the respective area

Page 5: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Run4: Results 5

Th [km] TUB [%]UG-CU

[%]

1 0,1 0,1

10 0,1 0,7

20 0,1 0,9

50 0,1 1,1

100 0,2 2,6

200 0,8 6,9

500 4,1 14,7

1000 14,8 21,2

2000 44,5 28,5

5000 81,0 29,6

10000 98,7 91,4

15000 100,0 95,7

20000 100,0 100,0

0

10

20

30

40

50

60

70

80

90

100

1 10 20 50 100 200 500 1000 2000 5000 100001500020000

Accu

rac

y [

%]

Margin of Error [km]

TUB

UG-CU

Page 6: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Run1: No additional data or gazetteers

combines textual and visual features: translation of tags and

extracted words (NLP) from the title and the description.

Porter stemmer and stop-word elimination for each segment

and granularity in the spatial segmentation.

Visual Search for the k-nearest segments in the lowest

hierarchy

6

Page 7: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Term-location-distribution:

Term frequency-inverse document frequency:

7

Page 8: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Example

Condence scores of the visual approach (right) restricted

to be in the most likely spatial segment determined by

the textual approach (left)

8

Page 9: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Run1: Results 9

0

10

20

30

40

50

60

70

80

90

100

1 10 20 50 100 200 500 1000 2000 5000 10000 15000 20000

Accu

rac

y [

%]

Margin of Error [km]

TUB

Th [km] TUB [%]1 13,710 32,720 36,550 39,4100 41,8200 44,8500 51,71000 62,42000 76,55000 92,310000 99,415000 100,020000 100,0

Page 10: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Run2: No additional data

For the highest hierarchy level the boundaries extraction using

gazetteers (GeoNames, Wikipedia and Google Maps) for the

spell checked words is added.

10

Page 11: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Collaborative Systems: Example 11

這是我上次去巴黎。在那裡,我得到了我的城堡在迪斯尼樂園看。

這是我上次去巴黎。在那裡,我得到了我的城堡在迪斯尼樂園看。…

Page 12: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Collaborative Systems: Example

這是我上次去巴黎。在那裡,我得到了我的城堡在迪斯尼樂園看。…

Which language is it?

Chinese

This was my last trip to Paris. I visited the castle in Disneyland…

Which words gives us information? Tags?

Trip, Paris, Castle, Disneyland

Which of these nouns have got geographical information?

Paris, Disneyland

12

Page 13: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Geographical Ambiguity 13

Paris

France

Canada

Puerto Rico

Disneyland

China

USA

France

R(ci) = Rank sum

ci = Countries

N = Number of toponym

Page 14: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Extracted geo. items

00001: hawaii, kauai, usa

14

hawaii

usa

kauii

Page 15: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Results 15

0

10

20

30

40

50

60

70

80

90

100

1 10 20 50 100 200 500 1000 2000 5000 10000 15000 20000

Accu

rac

y [

%]

Margin of Error [km]

Run1

Run2

Run4

Page 16: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Question

Thanks for your attention!

16

Page 17: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Training Set: Weighting 17

Page 18: How Spatial Segmentation improves the Multimodal Geo-Tagging

Kelm: How Spatial Segmentation improves the Multimodal Geo-

Tagging

Training Set: Features 18