Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc. 2013

Statistical methods for real estate data

prof. RNDr. Beáta Stehlíková, CSc.

Beáta Stehliková, Bratislava 2

Informationis currently besides financial, energy,material resources

the main factor of progress.

How to obtain new knowledge?

We want to answer the question:How to obtain new information, new knowledge from data?

Talk only about one method of spatial statistics

Why spatial statistics ?Methods of spatial statistics are for spatial

Real estate data contain very often information about the geographic location – there are spatial data

Variable and data

A variable - a characteristic of population or sample that is of interest for us.

Data - the actual values of variables

Different kinds of data

Cross-sectional data are data on one or more variables collected at a single point in time

Time series data data are collected over a period of time on one or more variables

Panel data – the same cross-section over time

Obs Price (SEK) Living Area 1 600 000 80 2 750 000 95 3 675 000 75 4 825 000 84 . . .

200 925 000 96

Obs. Year Index GDP 1 1981 101 900 2 1982 105 1050 3 1983 110 1200 .

20 1999 250 8500

in real estate

Types of data (scale)

We have said that data - the actual values of variables

Types of data: Interval data are numerical observations Ordinal data are ordered categorical observations Nominal data are categorical observations

Types of data (scale)

Knowing the type of data (scale) is necessary to properly

select the technique to be used when analyzing data.

Descriptive statistics involves arranging, summarizing, and presenting a set of data in such a way that useful

information is produced.

Descriptive statistics

graphical techniques (histogram)numerical descriptive measures

Mean (average) Median (middle value) Mode (most frequently ) Variance Standard deviation

Descriptive statistics are not enough

Average (17,8) Standard deviation (4,7) Coefficient of variation

(26,4 %) n=25 1

6,1 10,1 14,1 18,1 22,1

9,8 13,8 17,8 21,8 25,8

It is necessary to know

the probability distribution

Consider two data sets A and B

Second example

Consider two large data sets A and B

The location information

It is not possible to identify differences between data sets without we take into account the location information

The location information

Variograms quantify changes in values in the space

there is no there is spatial autocorrelation

small distances

correspond to small changes in values

small distances

correspond to large

changes in values

Spatial autocorrelation

The degree to which near and more distant things are interrelated

Measures of spatial autocorrelation attempt to deal with similarities

in the location of spatial objects and their attributes

Spatial autocorrelation

Positive (objects similar in location are similar in attribute)

Negative (objects similar in location are very different)

Zero (attributes are independent of location)

Spatial autocorrelation - measures.

Several measures available: Moran’s coefficient I, Geary’s C coefficient, Getis-Ord coefficient G.

These measures may be •“global” - they apply to the study region • or “local” - autocorrelation may exist in some parts of the region but not in others.

Moran’s coefficient I

varies between –1.0 and + 1.0 0 indicates no spatial autocorrelation [1/(n-1)]

(indicate random pattern) When autocorrelation is high, the I coefficient is

close to 1 or -1 Negative values I indicate negative

autocorrelation Positive values I indicate positive autocorrelation

(indicate a tendency toward clustering)

Regression analysis

is a technique for using data to identify relationships among variables and use these relationships to make predictions.

Regression analyses that ignore spatial dependency can have

unstable parameter estimates and unreliable significance tests.

Solution: Spatial Autoregressive Models Lag model Spatial Error model

Spatial Models

SPATIAL LAG SPATIAL ERROROrdinary Least Squares

No influence from neighbors

Dependent variable influenced by

neighbors

Residuals influenced by neighbors

Y = β0 + Xβ Y = β0 + λ WY + Xβ + ε Y = β0 + Xβ + ρWε + ξ

Lag model controls spatial autocorrelation in the dependent variable

Error model controls spatial autocorrelation in the residuals, thus it controls autocorrelation in

the dependent and the independent variables

Software GeoDa

Compare different spatial models

Neither R2 nor Adjusted R2 can be used to compare different spatial regression models

We can used Akaike Information Criteria (the smaller the AIC value the better the model)

Example

dependent variable y – price of dwellingindependent variable x – living area

Classical regression analysis

Residuals

Moran´s I = 0.193022

Significance:P value= 0.03140<0.05

This indicate positive spatial autocorrelation

between residuals.

Spatial error model

Local Moran’s coefficients

Which values produce spatial autocorrelation ?

Spatial statistics

Methods of spatial statistics very use full for data with the location information

The art of looking for beauty,and science looking for true.

Spatial statistics will help us find the truewhen we use the right methods

Statistical methods for real estate data prof. RNDr. Beáta Stehlíková, CSc. 2013

Documents

Eva Stehlíková The Laterna Magika of Josef Svoboda and Alfréd … · 2019. 3. 30. · Yorick_2011_20110828.indd 173 16.9.2011 ... living and intelligent show’ (HAVEL 1999: 251),

Inborn chromosomal abnormalities 5th year RNDr Z.Polívková

Embryology Doc. MUDr. Ing. RNDr. Peter Celec, DrSc., MPH petercelec@gmail.com

Csóka Beáta - Címlapcsokabeata.hu/Kottak/Bartok/Bartok___Mikrokosmos_Vol_4.pdfCreated Date 7/23/2003 3:47:47 PM

doc. RNDr. Juraj Bujdák, DrSc. - uniba.skdoc. RNDr. Juraj Bujdák, DrSc. ABC Kapitoly vo vedeckých monografiách vydané v zahraničných vydavateľstvách ABC01 Rode, Bernd Michael

UP MS Department of Biophysics -Beáta Bugyi 1 1.pdfBiophysics I 2013-2014 12/2/2014 UP MS Department of Biophysics -Beáta Bugyi 3 Hild, Bugyi et al. Cytoskeleton2010 AKTIN AKTIN

V. Black-Scholes model: Derivation and solution - uniba.sk · V. Black-Scholes model: Derivation and solution Beáta Stehlíková Financial derivatives, winter term 2014/2015 Faculty

Chromosomal basis of heredity RNDr. Z.Polívková Lecture No135 – Course:Cell structure

STUDY PROGRAM 2017/2018 Subjects of the Basic …. Bugyi Beáta 28 Molecular basis of muscle function and contraction regulation Dr. Bugyi Beáta Practices 1 Introduction. Laboratory

Prof. RNDr. Milan Mišík, DrSc. kompletný zoznam publikáciíProf. RNDr. Milan Mišík, DrSc. kompletný zoznam publikácií. Monografické práce a kapitoly v monografiách (monographic

Enzymopathy – Inherited Metabolic Disorders RNDr. Hana Zoubková, PhD Energy -228

Phenotypes, genotypes Populations genetics RNDr Z.Polívková RNDr Z.Polívková Lecture No 428 - Lecture No 428 - course : Heredity

RNDr. Michal Bal a zia, Ph.D. - Masaryk University · RNDr. Michal Bal a zia, Ph.D. Last Update 27/Jun/2018 Personal Nationality: Slovak Birthday: 09/Aug/1988 Marital status: single

GENERAL MEDICINE - med.muni.cz · prof. RNDr. Petr Dubový, CSc. Department of Anatomy - Theoretical Departments - Faculty of Medicine Contact Person: prof. RNDr. Petr Dubový, CSc

PhD thesis Bartosova STAG · Department of Parasitology Ph.D. Thesis Phylogenetic analyses of myxosporeans based on the molecular data RNDr. Pavla Bartošová Supervisor: RNDr. Ivan

EXPERIENCES OF THE EVALUATOR RNDr. Zuzana BOUKALOVÁ CROSSCZECH, CCSS, GEO Group

Analysis of Human EEG Data Pavel Stránský Supervisor: Prof. RNDr. Petr Šeba, DrSc

Captivating Symmetry by RNDr Daniela Richtarikova Slovak Technical University (Slovakia)

HUNGARY - EURORDIS · PDF fileHUNGARY – EUROPLAN ... Katalin Brunner, patient representative (HUFERDIS) ... Németh, Károly Fogarassy, Péter Horváth, Katalin Brunner, Beáta Boncz,

Beáta Fekete Syntheses and transformations of alicyclic