15
Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS, Univ. of Minnesota) Uygar Ozesmi(Ericyes University, Turkey) http://www.cs.umn.edu/research/

Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

  • View
    217

  • Download
    2

Embed Size (px)

Citation preview

Page 1: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Predicting Locations Using Map Similarity(PLUMS): A Framework for

Spatial Data Mining

Sanjay Chawla(Vignette Corporation)

Shashi Shekhar, Weili Wu(CS, Univ. of Minnesota)

Uygar Ozesmi(Ericyes University, Turkey)

http://www.cs.umn.edu/research/shashi-group

Page 2: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Outline• Motivation• Application Domain• Distinguishing characteristics of spatial data

mining• Problem Definition• Spatial Statistics Approach• Our approach: PLUMS• Experiments, Results, Conclusion and Future

Work

Page 3: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Motivation• Historical Examples of Spatial Data

Exploration– Asiatic Cholera, 1855– Theory of Gondwanaland– Effect of fluoride on Dental Hygiene

• A potential application in news– Tracking the West Nile Virus

Page 4: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Application Domain

• Wetland Management: Predicting locations of bird(red-winged blackbird) nests in wetlands

• Why we choose this application ?– Strong spatial component– Domain Expertise– Classical Data Mining techniques(logistic

regression, neural nets) had already been applied

Page 5: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Application Domain: Continued..

Nest Locations Distance to open water

Vegetation Durability Water Depth

Page 6: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Unique characteristics of spatial data mining

Spatial Autocorrelation Property

Page 7: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Unique characteristics…cont

K

kkk PnearestAAd

KPAADNP

1

))(.,(1

),(

Average Distance to Nearest Prediction(ADNP):

Page 8: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Location Prediction:Problem Formulation

• Given: A spatial framework S.– Explanatory functions,

– Dependent function

– A family F F of learning model function mappings

• Find an element

• Objective: maximize (map_similarity = classification_accuracy + spatial accuracy)

• Constraints: spatial autocorrelation exists

kX RSfk

:

}1,0{: YY RSf

ykky RRRFf ....:ˆ

Page 9: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Spatial Statistics Approach1.

2. Xy XWyy

2”

X

X

e

eyob

1)1(PrLogistic Regression:

Page 10: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Spatial Stat: Solution Techniques

• Least Square Estimation: Biased and Inconsistent

• Maximum Likelihood: Involve computation of large determinant(from W)

• Bayesian: Monte Carlo Markov Chain(e.g. Gibbs Sampling)

Page 11: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Our Approach

Page 12: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Experiment Setup

Page 13: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Result(1)

FNTP

TPTPR

TNFP

FPFPR

Page 14: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Result(2)

Page 15: Predicting Locations Using Map Similarity(PLUMS): A Framework for Spatial Data Mining Sanjay Chawla(Vignette Corporation) Shashi Shekhar, Weili Wu(CS,

Conclusion and Future work

• PLUMS >> Classical Data Mining techniques

• PLUMS State-of-the-art Spatial Statistics approaches

• Better performance(two orders of magnitude)

• Try other configurations of the PLUMS framework and formalize!