27
Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1 , Nikos Pelekis 1 , Yannis Theodoridis 1 , Cyril Ray 2 , Vangelis Karkaletsis 3 , Sergios Petridis 3 , Anastasia Miliou 4 1 University of Piraeus 2 Naval Academy, France 3 NCSR “Demokritos” 4 Archipelago – Inst. of Marine Conservation 1

Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Embed Size (px)

Citation preview

Page 1: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Efficient AIS Data Processing for Environmentally Safe Shipping

Marios Vodas1, Nikos Pelekis1, Yannis Theodoridis1, Cyril Ray2, Vangelis Karkaletsis3, Sergios Petridis3,

Anastasia Miliou4

1 University of Piraeus2 Naval Academy, France

3 NCSR “Demokritos”4 Archipelago – Inst. of Marine Conservation

1

Page 2: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Outline

1. Part I: Marine Transportation

2. Part II: Automatic Identification System (AIS)

3. Part III: Objectives

4. Part IV: Methodology

5. Part V: Conclusion

2

Page 3: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

I. MARITIME TRANSPORTATION

3

Page 4: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Safety (and Environmental) Issues

Ships, control centers and marine officers have to face many security and safety problems due to: Staff reduction, cognitive overload, human errors Traffic increase (ports, maritime routes), dangerous contents Terrorism, pirates Technical faults (bad design, equipment breakdowns) Bad weather Etc.

4MarineTraffic.comHELCOME AIS IRENav (NATO)

Page 5: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

The Most Prominent Cause of Accidents

About 75-96% of marine casualties are caused, at least in part, by some form of human error * : 88% of tanker accidents 79% of towing vessel groundings 96% of collisions 75% of fires and explosions

Solution to such issues requires different levels of responses taking into account : People (activities) Technology Environment Organisational factors

5

*Rothblum A.M. (2006) “Human Error and Marine Safety”, U.S. Coast Guard Research & Development

Center

Page 6: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Ways to Minimize Accidents

Level of education and practice for mariners

Work safety regulations (behaviour guidelines, normalised onboard equipments)

Navigation and decision support systems providing real-time information, predictions, alerts...

Integrate and use properly multiple and heterogeneous positioning systems : AIS, ARPA, Long Range Identification System (LRIT), Global Maritime Distress and Safety System (GMDSS), synthetic aperture radar, airborne radar, satellite based sensors

Generalisation of vessel traffic monitoring, port control, search and rescue systems, automatic communications

6

Page 7: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Traffic Monitoring

7

Air-based supportHuman and semi-automatic

monitoringOn-demand and on a regular basis

Remote Sensing supportSemi-automatic monitoring

Every 2 to 6 hours

Sensor-based supportAlmost automatic analysis

and monitoringReal-time

Page 8: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

II. AUTOMATIC IDENTIFICATION SYSTEM (AIS)

8

Page 9: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

AIS Device

The Automatic Identification System identifies and locates vessels at distance It includes an antenna, a transponder, a GPS receiver and additional

sensors (e.g., loch and gyrocompass) It is a broadcast system based on VHF communications It is able to operate in autonomous and continuous mode

Ships fitted with AIS send navigation data to surrounding receivers (range is about 50 km)

Ships or maritime control centres on shore fitted with AIS receives navigation data sent by surrounding ships

9

→ AIS is mandatory (IMO) for big ships and passengers’ boats

Page 10: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

AIS Transmission Rate and Accuracy

AIS accuracy is defined as the largest distance the ship can cover between two updates The AIS broadcasts information with different rates of updates

depending on the ship’s current speed and manoeuvre The IMO assumes that accuracy of embedded GPS is 10m

10

Vessel behaviourTime

between updates

Accuracy (m)

Anchored 3 min = 10 metres

Speed between 0-14 knots 12 s Between 10 and 95 metres

Speed between 0-14 knots and changing course

4 s Between 10 and 40 metres

Speed between 14-23 knots 6 s Between 55 and 80 metres

Speed between 14-23 knots and changing course

2 s Between 25 and 35 metres

Speed over 23 knots 3 s > 45 metres

Speed over 23 knots and changing course

2 s > 35 metres

General update rules have been compared to reality: it appears that update rates are lower

Page 11: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

AIS Data

The AIS provide location-based information on 2D routes, this defining point-based 3D trajectories

Transmitted data include ship’s position and textual meta-information Static: ID number (MMSI), IMO code, ship name and type,

dimensions Dynamic: Position (Long, Lat), speed, heading, course over ground

(COG), rate of turn (ROT) Route-based: Destination, danger, estimated time of arrival (ETA)

and draught

11

That is, an ordered series of locations (X,Y,T) of a given mobile object O with T indicating the timestamp of the location (X,Y)

→ Time does not exist in AIS frames : to be add by receivers

!AIVDM,1,1,,A,1Bwj:v0P1=1f75REQg>rPwv:0000,0*3B

Page 12: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

III. OBJECTIVES

12

Page 13: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Big AIS Data Processing for Environmentally Safe Shipping

Objectives, based on Archipelagos Institute of Marine Conservation requests, was to Investigate factors which contribute most to the risk of a shipping

accident Identify dangerous areas

How : traffic database processing in order to address some requirements / queries set by Archipelagos towards semi-quantitative risk analysis of shipping traffic

13

→ Data coming from AIS

→ Application to the Aegean Sea

Page 14: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Typical Questions From Domain Experts

Calculate average and minimum distances from shore or between two ships

Calculate the maximum number of ships in the vicinity of another ship

Find whether (and how many times) a ship goes through specified areas (e.g. narrow passages, biodiversity boxes)

Calculate the number of sharp changes in ship’s direction

Find typical routes vs. outliers etc. etc.

14

Page 15: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Mediterranean Sea

European Maritime Safety Agency (EMSA) centralizes data from EU states and provides them through a Web service

We worked on a dataset on Mediterranean sea provided By IMIS Hellas (a Greek IT company related to IMIS Global, collecting AIS data,

mariweb.gr)

15

→ Data Volume is 100 million positions per month, that is about 2300 positions per minutes

• Focus on Aegean sea : 3 days, 3 million position records (933 distinct ships)

• Full dataset is more than 2000 SQL tables for a total of 2 TB covering 2,5 years of vessel activity

Two datasets are available at Chorochronos.org interface (IMIS 3 days and AIS Brest)

Page 16: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Vessel Statistics

16

Country Number of ships Flag of ConvenienceGreece 263 No

Panama (Republic of) 112 Yes

Turkey 96 No

Malta 76 Yes

Liberia (Republic of) 32 Yes

Vincent and the Grenadines

29 Yes

Page 17: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

IV. METHODOLOGY

17

Page 18: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Populating a Database

Relational database (postgres and postgis) Data model based on AIS messages :

positions, ships and trips Parsing, Integration, error checking filtering Reconstructing trajectories from raw data

and feeding a trajectory DB Apply “simple” queries to answer experts

needs

18

“What is the (sub)trajectory of a ship during its presence in an area” ?

Page 19: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

MOD Engine and Rule-Based Analysis

An integrated approach for maritime situation awareness based on an inference engine (drools) The expert defines his rules according its needs

and objectives The engine executes rules using the AIS database

19

Hermes is a MOD engine providing extensible DBMS support for trajectory data Defines trajectory data type

SQL extensions at the logical level Efficient indexing techniques at the physical level

Includes trajectory clustering support

Mixed top-down / bottom-up approach involving an expert monitoring real-time

traffic on a touch table

http://infolab.cs.unipi.gr/hermes

Page 20: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Methodology Steps

20

CleaningFilter:

Wrong CRC Duplicates

DecodingAIS type:

1/2/3 Position Report 5 Static and Voyage Related Data

CleaningFilter:

Invalid MMSI GPS Error ͙R

Hermes Loader

Degrees to Meters Trajectory Update Outputs Trajectories

Querying

Timeslice Range

Temporal only Spatial only Spatio-Temporal

Nearest Neighbor (NN) wrt. a reference static object

(point / segment / box) wrt. a reference trajectory

Advanced Querying

Pair-wise similarity queries OD-Matrix

origin/destination are spatial vs. spatio-temporal boxes

Trajectory Clustering

Page 21: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Take the Maritime Environment Into Account

The maritime domain is peculiar as there is no underlying network but some maritime rules define predefined paths and anchorage areas (polylines and polygons) that might constrain a given trajectory

21

We added official vector chart and expert-defined areas of interest in the database

Coastlines

Starting, ending, passing, restricted areas, waiting zones

Regulations and dangers (rocs, buoys, seabed)

S-57 ENC (Electronic Nautical Chart)

Page 22: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Exploring the Data

Calculating trajectory aggregations and feeding a trajectory data warehouse

Performing OLAP analysis over aggregations (eg. O/D analysis) Running KDD techniques : frequent pattern analysis,

clustering, outlier detection, etc.

22

Cloud of locationsAssociation of points

coming from the same source-destination set

Definition of a route and qualifying of positions at

each time

Qualifying of a new trajectory compared to the identified route

Page 23: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Visualizing Trajectories and Patterns

23

← space-time cube: ship is late

space-time cube: trajectory too far

on the right →

speed behaviour

frequent patterns

→ Web-based visualisation using Google Maps / Earth applications, Openlayers (OSM)

Page 24: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

V. CONCLUSION

24

Page 25: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Some Open Questions

Q1. What kind of storage is appropriate for BIG volumes of vessel traffic data?

Serial vs. parallel/distributed processing (e.g. Hadoop) (batch vs. streaming) MOD engines? What about indexing BIG mobility data?

Q2. What kind of analysis on vessel traffic data makes sense? Analysis on current (location, speed, heading, …) vs. historical

information (trajectories) Clusters (+ outliers), frequent patterns, next location prediction,

etc. Exploit on previous knowledge to improve real-time analysis

Q3. What kind of visualization is appropriate for vessel traffic data / patterns

Current location vs. trajectory-based visual analytics

25

Trajectory clustering

Frequent pattern mining

Page 26: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Research Challenges on Data – Just a Few Examples

Trajectory compression / simplification: how to compress / simplify trajectories keeping quality as high as possible?

Semantic trajectory reconstruction: how to extract semantics from raw (GPS-based) trajectory data?

Trajectory sampling: how to find a representative sample among a trajectory dataset?

Generating trajectories by example: how to build large synthetic datasets that simulate the ‘behavior’ of a small real one?

Etc.

26

Page 27: Efficient AIS Data Processing for Environmentally Safe Shipping Marios Vodas 1, Nikos Pelekis 1, Yannis Theodoridis 1, Cyril Ray 2, Vangelis Karkaletsis

Questions

27