24
1 Indexing Large Trajectory Data Sets With SETI V.Prasad Chakka Adam C.Everspaugh Jignesh M.Patel University of Michigan Presented by Guangyue Jia

Indexing Large Trajectory Data Sets With SETI

  • Upload
    baylee

  • View
    48

  • Download
    0

Embed Size (px)

DESCRIPTION

Indexing Large Trajectory Data Sets With SETI. V.Prasad Chakka Adam C.Everspaugh Jignesh M.Patel University of Michigan Presented by Guangyue Jia. Overview. Motivation Problem definition and query types SETI Experimental Evaluation Strong and weak points - PowerPoint PPT Presentation

Citation preview

Page 1: Indexing Large Trajectory Data Sets With SETI

1

Indexing Large Trajectory Data Sets With SETI

V.Prasad Chakka Adam C.Everspaugh Jignesh M.Patel

University of Michigan

Presented byGuangyue Jia

Page 2: Indexing Large Trajectory Data Sets With SETI

2

Overview

• Motivation

• Problem definition and query types

• SETI

• Experimental Evaluation

• Strong and weak points

• Relation and stimulation to our project

• Conclusion

Page 3: Indexing Large Trajectory Data Sets With SETI

3

Motivation• Location based systems are used everywhere.

– Existing LBS: GPS, Navigation systems, Others– How many cars were in the center of Aalborg from 10

to 11 o´clock.

• Efficient and Inexpensive techniques. Previous Indices: B-tree, R-tree, Others

• New methord . SETI—Scalable and Efficient Trajectory Index

Page 4: Indexing Large Trajectory Data Sets With SETI

4

Overview

• Motivation

• Problem definition and query types

• SETI

• Experimental Evaluation

• Strong and weak points

• Relation and stimulation to our project

• Conclusion

Page 5: Indexing Large Trajectory Data Sets With SETI

5

Problem definition and query types

• Data model– Trajectory is represented as

trj (tid, <U0, U1, U2, ......Un, ......>).

– Segment is represented as

si (tid, sid, ui-1, ui).

– Point u is a three-tuple

ui (xi, yi, ti)

Page 6: Indexing Large Trajectory Data Sets With SETI

6

Problem definition and query types

• Query types– Queries that ask questions about the future

positions of moving objects.• Where is car A after one hour?• answered by storing current position, speed and

the direction of the moving objects.

– Queries that ask questions about the historical positions of moving objects.

• time interval query • time slice query• nearest neighbor query• Where is car A at 5pm, yesterday?

Page 7: Indexing Large Trajectory Data Sets With SETI

7

Overview

• Motivation

• Problem definition and query types

• SETI

• Experimental Evaluation

• Strong and weak points

• Relation and stimulation to our project

• Conclusion

Page 8: Indexing Large Trajectory Data Sets With SETI

8

SETI

• Description

• Insert

• Example of the Insert Procedure

• Search

• Deletes and Updates

Page 9: Indexing Large Trajectory Data Sets With SETI

9

SETI-Description

• Is a logical indexing structure that built on top of an existing spatial indexing techniques.– R-tree.

• Partition + temporal indices.– Abandon 3D indexing technology.– Partition the 2D spatial data.– Index lines in 1D(time) dimension.

• Data page– Each data page only contains segments that belong to the same

spatial cell.– Lifetime of the data page.

• Use of multiple sparse indices.– One entry for each data page.

Page 10: Indexing Large Trajectory Data Sets With SETI

10

SETI-Insert-is a cache-Maintains the last updated location-pull out the last known location-updated with the new location

-Determines the particular spatial cells-split segments which span multiple spatial cells

Page 11: Indexing Large Trajectory Data Sets With SETI

11

SETI-Example of the Insert ProcedureDescription:

-A is the current location of O.

-O move from A to A´.

-AA´ represent the movement of O between the two updates.

Procedure:

1, A´ are sent to insert module.

2, Front Line receive A´, pull out A, update by A´, and send AA´to Partitioning Module.

3, Partitioning Module receive AA´, and determine the spatial cells for AA´, and also break AA´ if it spans multiple cells.

4, Update the temporal indices and Data File

Page 12: Indexing Large Trajectory Data Sets With SETI

12

SETI-Example of the Insert Procedure-AA´spans two spatial cells.

-AA´ is broken into two smaller segments: AX and XA´.

-X is the intersection point.

-X is a logical update location.

-AX and XA´are inserted into the spatial cells.

-AX and XA still represent the single segment AA´.

-Also need calculate the time of point X.

Page 13: Indexing Large Trajectory Data Sets With SETI

13

SETI-SearchSpatial Filtering:

produce candidate cells

Temporal Filtering:

probe temporal indices in the candidate cells.

Refinement Step:

if page completely inside the spatial predicate box.

then

if the temporal predicate range contains the page lifetime

then select all segments on the page

else apply query on each segments

else apply query on each segments

Duplicate Elimination:

use bitmap

Page 14: Indexing Large Trajectory Data Sets With SETI

14

SETI-Deletes and Updates

• Deletion types:– Delete particular segment– Delete complete trajectory

• Segment deletion– Use bounding box

• Complete trajectory deletion– All the segments of the trajectory must be identified.– Use an auxiliary composite B+-tree index the

trajectory ID and the segment number of the trajectory.

• Updates– Deletion+Insertion

Page 15: Indexing Large Trajectory Data Sets With SETI

15

Overview

• Motivation

• Problem definition and query types

• SETI

• Experimental Evaluation

• Strong and weak points

• Relation and stimulation to our project

• Conclusion

Page 16: Indexing Large Trajectory Data Sets With SETI

16

Experimental Evaluation

• Experimental platform and software– Intel Pentium III 600MHz, 384MB main memory, 60GB IBM

Deskstar 7200 RPM ULtra ATA/100 disk, Debian Linux version 2.4.13

– Software is a system called COMET.

• Data Sets– GSTD– Net work data

• Queries– Time interval query: Equal normalized widths 3D box.– Time slice query: time stamp value and 2D spatial range.

Page 17: Indexing Large Trajectory Data Sets With SETI

17

Experimental Evaluation

Effect of Number of spatial Partitioning Cells, GSTD(1K, 4M), 0.1% Time-interval Query

Index Sizes, GSTD(1K, X)

Page 18: Indexing Large Trajectory Data Sets With SETI

18

Experimental Evaluation

Comparing Insert Performance, GSTD(1K, 4M), 10K Inserts

Scaling with Number of Segments, GSTD(1K, X), 0.01% Time-interval Query

Page 19: Indexing Large Trajectory Data Sets With SETI

19

Overview

• Motivation

• Problem definition and query types

• SETI

• Experimental Evaluation

• Strong and weak points

• Relation and stimulation to our project

• Conclusion

Page 20: Indexing Large Trajectory Data Sets With SETI

20

Strong and weak points

• Strong points– The structure of the paper is clear– Nearly complete experiment– Use sparse indices

• Weak points– No algorithm to contrast– Too briefly introduce some important technique:

• section 3.1 about indices clustered.• and section 3.5 about dynamic partition.

Page 21: Indexing Large Trajectory Data Sets With SETI

21

Overview

• Motivation

• Problem definition and query types

• SETI

• Experimental Evaluation

• Strong and weak points

• Relation and stimulation to our project

• Conclusion

Page 22: Indexing Large Trajectory Data Sets With SETI

22

Relation and stimulation to our project

• Same problem– Very similar data model and query types.

• Same experimental procedure– We also plan to compare different indexing

techniques.

• Different partitioning structure– We use static partitioning strategy.– We insert the segment which spans multiple spatial

cells into all cells it spans.

• Create Data Page and use sparse indices.

Page 23: Indexing Large Trajectory Data Sets With SETI

23

Overview

• Motivation

• Problem definition and query types

• SETI

• Experimental Evaluation

• Strong and weak points

• Relation and stimulation to our project

• Conclusion

Page 24: Indexing Large Trajectory Data Sets With SETI

24

Conclusion

• SETI is a new indexing method which build on an existing index(R-tree).

• SETI use sparse temporal indices + spatial partitions.

• SETI is good at range space based queries, but maybe not good at specific object based queries.