40
Visualising heterogeneous cinema data sets Big, Open Data and the Practice of GIScience RGS-IBG Annual Conference, London 29 August 2013 Colin Arrowsmith, School of Mathematical and Geospatial Science, RMIT University, Melbourne, Victoria, Australia Deb Verhoeven and Alwyn Davidson, School of Communication and Creative Arts, Deakin University, Melbourne, Victoria, Australia

London 2013 rgs_arrowsmith

Embed Size (px)

Citation preview

Page 1: London 2013 rgs_arrowsmith

Visualising heterogeneous cinema data sets

Big, Open Data and the Practice of GIScienceRGS-IBG Annual Conference, London 29 August 2013

Colin Arrowsmith, School of Mathematical and Geospatial Science, RMIT University, Melbourne, Victoria, Australia

Deb Verhoeven and Alwyn Davidson, School of Communication and Creative Arts, Deakin University, Melbourne, Victoria, Australia

Page 2: London 2013 rgs_arrowsmith

2

A big data project

“Only at the movies: Kinomatics”

School of Mathematical and Geospatial Sciences

Page 3: London 2013 rgs_arrowsmith

3

Objective

To investigate spatial patterns of film diffusion across the world.

–How do films circulate around the world?

–Does spatial clustering affect film screening?

–How does seasonality affect screening?

School of Mathematical and Geospatial Sciences

Page 4: London 2013 rgs_arrowsmith

Dimensions of “Big data”

• Variety

• Velocity

• Volume

IBM “Bringing big data to the Enterprise”

(http://www-01.ibm.com/software/au/data/bigdata/)

• Visualization

School of Mathematical and Geospatial Sciences 4

Page 5: London 2013 rgs_arrowsmith

Working with “Big data”

• Database downloaded from commercial film data collector

• 2 to 2.5 million showtime records per week

• 30000 movies downloaded after seven months

• 28000 cinema venues and 118000 screens

• 63.5 million records equating to 4.8 Gbytes of data

School of Mathematical and Geospatial Sciences 5

Page 6: London 2013 rgs_arrowsmith

Database schema

School of Mathematical and Geospatial Sciences 6

Page 7: London 2013 rgs_arrowsmith

Projects exploring approaches for visualising and analysing big film data

• Geographic methods– Post-war cinema venues in Australia (change-over-time)

– Global cartograms for cinema (point-in-time)

– Global patterns of movement

• Non-geographic (conceptual)– Multivariate visualisations (change-over-time)

– Film circulation (Markov-Chains)

School of Mathematical and Geospatial Sciences 7

Page 8: London 2013 rgs_arrowsmith

Geographic examples

• Post-war cinema venues in Australia (change-over-time)

• Global cartograms for cinema (point-in-time)

• Global patterns

School of Mathematical and Geospatial Sciences 8

Page 9: London 2013 rgs_arrowsmith

Static maps of post war cinema venues in Australia

• Basis for data was scanned “Film Weekly” summaries

• Base year of 1948 derived

• New and closed cinemas determined

• Significant post-processing

School of Mathematical and Geospatial Sciences 9

Page 10: London 2013 rgs_arrowsmith

Film Weekly scan

School of Mathematical and Geospatial Sciences 10

Page 11: London 2013 rgs_arrowsmith

Rural scale changes

1948 to 1953

1963 to 1968 1968 to 1971

1953 to 1958 1958 to 1963

School of Mathematical and Geospatial Sciences 11

Page 12: London 2013 rgs_arrowsmith

Rural scale changes

1948 to 1953

1963 to 1968 1968 to 1971

1953 to 1958 1958 to 1963

School of Mathematical and Geospatial Sciences 12

Page 13: London 2013 rgs_arrowsmith

Urban scale changes (Melbourne)

1948 to 1953

1963 to 1968 1968 to 1971

1953 to 1958 1958 to 1963

School of Mathematical and Geospatial Sciences 13

Page 14: London 2013 rgs_arrowsmith

Global cinema cartograms

• Cartogram is a map where a thematic variable is substituted for area (or distance)

• Population substituted for area

School of Mathematical and Geospatial Sciences 14

Page 15: London 2013 rgs_arrowsmith

CartogramsGlobal cinema numbers

15

Page 16: London 2013 rgs_arrowsmith

Global screen numbers

16

Page 17: London 2013 rgs_arrowsmith

Continent-wide patterns

School of Mathematical and Geospatial Sciences 17

Page 18: London 2013 rgs_arrowsmith

Global patterns

School of Mathematical and Geospatial Sciences 18

Page 19: London 2013 rgs_arrowsmith

Life of Pi

30 November 2012 7 December 2012

14 December 2012 21 December 2012 19

Page 20: London 2013 rgs_arrowsmith

Life of Pi

28 December 2012 4 January 2013

11 January 2013 17 January 2013 20

Page 21: London 2013 rgs_arrowsmith

Life of Pi (November 2012 to January 2013)

School of Mathematical and Geospatial Sciences 21

Page 22: London 2013 rgs_arrowsmith

Life of Pi (November 2012 to January 2013)

School of Mathematical and Geospatial Sciences 22

Page 23: London 2013 rgs_arrowsmith

Non-geographic examples

• Multivariate visualisations (change-over-time)

• Film circulation (Markov-Chains)

School of Mathematical and Geospatial Sciences 23

Page 24: London 2013 rgs_arrowsmith

24

Page 25: London 2013 rgs_arrowsmith

Visualisations

School of Mathematical and Geospatial Sciences 25

Page 26: London 2013 rgs_arrowsmith

Movement approaches: The Greek cinema circuit

• Objective

– To explore historical changes in the diasporic Greek cinema distribution of Finos and Anzervos films during the period 1956 to 1963

• Rationale

– To demonstrate the role of geographic analysis in understanding cinema circuit behaviour

School of Mathematical and Geospatial Sciences 26

Page 27: London 2013 rgs_arrowsmith

Data acquisition

• Archival newspaper and oral history research

• Government records

– censorship records

– theatre licence and company records

• Geo-location using street address or via GPS

School of Mathematical and Geospatial Sciences 27

Page 28: London 2013 rgs_arrowsmith

Anzervos

School of Mathematical and Geospatial Sciences 28

Page 29: London 2013 rgs_arrowsmith

Finos

School of Mathematical and Geospatial Sciences 29

Page 30: London 2013 rgs_arrowsmith

Anzervos (section)

School of Mathematical and Geospatial Sciences 30

Page 31: London 2013 rgs_arrowsmith

Finos (section)

School of Mathematical and Geospatial Sciences 31

Page 32: London 2013 rgs_arrowsmith

Key chains identified

No. of venues

Anzervos Finos

1 B C B A

2 BC CB BC AD

3 BCB CBC BCB BCA

4 BCBC MGPC BCBC BCBA

School of Mathematical and Geospatial Sciences 32

Page 33: London 2013 rgs_arrowsmith

Circos – circular visualisations

• Film sequence (Fort of Freedom):

– BCBBBBBCAABBBBBB by screening

or

– BCBCAB venue sequencing

School of Mathematical and Geospatial Sciences 33

Page 34: London 2013 rgs_arrowsmith

Change in sequence (Anzervos)

The Fort of FreedomAli Pasha and Mrs Frossini

School of Mathematical and Geospatial Sciences 34

Page 35: London 2013 rgs_arrowsmith

Change in sequence (Finos)

AsteroMusic, Povery and Pride

School of Mathematical and Geospatial Sciences 35

Page 36: London 2013 rgs_arrowsmith

Change of venue date

School of Mathematical and Geospatial Sciences 36

Page 37: London 2013 rgs_arrowsmith

Change of venue date

Ali Pasha and Mrs Frosini

SJA

AACC

CBBBBBBCB

0

0.5

1

1.5

2

2.5

3

3.5

0 10 20 30 40 50 60 70

Days

Mo

nth

s

The Fort of Freedom

BBBBBA

ACBBBBBCB

0

5

10

15

20

25

30

35

0 5 10 15 20 25 30 35 40

Days

Mo

nth

s

School of Mathematical and Geospatial Sciences 37

Page 38: London 2013 rgs_arrowsmith

Change of venue date

Music, Poverty and Pride

F

KKDDDDABBBBBB

BBBBB

IIAAG

CP

0

10

20

30

40

50

60

70

80

90

100

0 20 40 60 80 100 120

Days

Mo

nth

s

Astero

DKFJJJJJJ

BBBBBB

DOO

BBBBBAA

AABBCBBBBBBCBBB

0

5

10

15

20

25

30

35

0 50 100 150 200 250

Days

Mo

nth

s

School of Mathematical and Geospatial Sciences 38

Page 39: London 2013 rgs_arrowsmith

• The olives are where films finished: green= Sydney venue, purple = Melbourne venue

• Leaves are screenings: yellow is QLD, light green is NSW, darker green is VIC, dark brown is SA

• The distance is days between screenings and done to scale

Finos

OLIVE TREES

Anzervos

School of Mathematical and Geospatial Sciences 39

Page 40: London 2013 rgs_arrowsmith

Issues working with “big” complex cinema data

•Multiple sources of data

•Working at multiple scales

•Working with historic data

•Multiple definitions

•Need for visualising both geographic and conceptual relationships

School of Mathematical and Geospatial Sciences 40