18
Car sharing through the data analysis lens Chiara Boldrini*, Raffaele Bruno, and Mohamed Haitam Laarabi IIT-CNR, Italy

Car sharing through the data analysis lens

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Car sharing through the data analysis lens

Carsharingthroughthedataanalysislens

ChiaraBoldrini*,RaffaeleBruno,andMohamedHaitam LaarabiIIT-CNR,Italy

Page 2: Car sharing through the data analysis lens

Carsharing• ShareduseofafleetofcarsbyCSmembers• Obstaclesforcoordinationremovedbytechnology

• CSexplosioninthelast15years• 3maincarsharingmodes

• Two-way:tripsstartandendatthesameCSstation• One-way:tripsmayendatanyCSstation• Freefloating:on-streetparkinganywherewithinthegeofence

1891

Invention of taximeter

1916 1948 1970

Joe Saunder decided to lend out his Ford Model T to local and visiting businessmen

Sefage started its service in Zürich

Witkar(technology-based) in Amsterdam

2000

Zipcar

2008

Car2go(free-floating)

explosion2011

AutolibDriveNow

2013

Enjoy

Page 3: Car sharing through the data analysis lens

Motivationbehindthiswork

• OpenproblemsinCSresearch• Vehicleredistribution• Cleaningandmaintenance• Infrastructureplanning

• Carsharingisaweaksignalinthecitylandscape• thefractionofpeoplerelyingoncarsharingfortheirdailytripsisrapidlyincreasing

butitisstillintheorderofsingledigitpercentagepointsinthebestcases.

• Carsharinghasbeenmostlystudiedthroughsurveys anddirectinterviewswithitsmembers.

• Carsharingistypicallynotaccountedforinhouseholdstraveldiariesperiodicallycollectedbycityadministrations.

• Candataminingofferhelpfulinsights?

Page 4: Car sharing through the data analysis lens

Thedataset

• AvailabilityovertimeofCSvehicles in10Europeancitiesforoneofthemajorfree-floating carsharingoperators.

• Observationperiod:• May17,2015andJune30,2015(for9cities)• March11,2016toMay12,2016.

• Datacollectedevery1minute usingtheavailablepublicAPI,whichyieldsresponsesintheformofJSONfiles.

• Datacleaning:• technicalproblemsonthebookingwebsite->corrupted

entriesdiscarded• faultyGPSsystems->coordinatesthataremanifestlyinvalid

(e.g.,carsavailableindifferentcountries)havebeendiscarded.

• DatapreprocessingandanalysishasbeencarriedoutinR.

Electricvehiclesonly

Page 5: Car sharing through the data analysis lens

Datasetlimitations

• Movementsareinferred fromcarsdisappearingontheCSmap• MovementfromAtoB=acardisappearsfromlocationAtolaterreappearatlocationB

• Noexplicitwayfordistinguishingbetweenregularcustomertripsandmaintenancetrips

• Nodirectinformationaboutthetrajectory followedbythesharedvehicle

• WehavequeriedGoogleMapsaskingfordirectionsandexpectedtraveltimebetweenthesourceanddestinationcoordinatesofeachtrip

• Estimatedthetravelleddistanceusingreal-lifeaverageconsumptionextractedfromhttps://www.spritmonitor.de/en/

<vehicle_id, GPS_coords, engine_type, fuel_level, interior_exterior_state>

Page 6: Car sharing through the data analysis lens

Themodeshareinthe10cities

• 3classesofcities:oneinwhichmotorised modesdominate,oneinwhichpublictransport(andhencewalking)aremoreimportant,andoneinwhichpeoplemoveprevalentlybybike

Source:Eurostat’sCityUrbanAuditdatabase

Page 7: Car sharing through the data analysis lens

Vehicleutilizationrate

• thenumberofdailytrips pervehicle• indexthatisoftenusedasameasureofcarsharingsuccess,asitcapturesshortandfrequenttrips

Service was in fact shutdown

Page 8: Car sharing through the data analysis lens

DoesmodalsplitcorrelatewithsuccessfulCS?

• Bike(Pearsonr=−0.34)

• Publictransport(r=0.22)

• Walkingandmotorcycle(r=0.06andr=0.0051)

• Cars(r=0.09).

Page 9: Car sharing through the data analysis lens

Dowereallyneedrelocation?

• Wedividetheoperationalareaincellswithsidelength500m

• Weobservehowthe%ofemptycellsvariesovertime

There are always a lot of empty zones in

the cities!

The CS in City#9 had opened just a few weeks before our

data collection, and its service hadn’t yet stabilized.

Page 10: Car sharing through the data analysis lens

Therelocationpotential– part1

Even in the best

case, vehicles remain parked most of the time!

• #emptycells +#idlevehicles =strongconcentrationofvehiclesincertainareas

• Thisisgoodnewsfortheresearchonvehicleredistribution:theoperatorcanindeedexploitalargenumberofvehiclesthatarenotusedmostofthetime…

Page 11: Car sharing through the data analysis lens

Therelocationpotential– part2

• …provideditispossibletoaccuratelypredictwherecarswillberequestedinthenearfuture

• WemeasureCELL REGULARITY intermsofthenumberofpickupeventsobservedwithinthecellduringworkingdays.

• Inordertomeasurehowmuchthenumberofpickupsvariesacrosstheobservationperiodweusethetechniquedescribedin[1].

• Wedivideeachdayintobins• ForeachcellN,wecomputethe#oftripsstartingatNforeachbin(1,…,n)ofdayi:

• Wecomputetheaccumulatedvarianceduringthel daysofobservationperiodas:

[1]Zhong,Chen,etal."Variabilityinregularity:MiningtemporalmobilitypatternsinLondon,SingaporeandBeijingusingsmart-carddata." PloS one 11.2(2016):e0149222.[1]Zhong,Chen,etal."Variabilityinregularity:MiningtemporalmobilitypatternsinLondon,SingaporeandBeijingusingsmart-carddata." PloS one 11.2(2016):e0149222.

Squared correlation

Page 12: Car sharing through the data analysis lens

the vast majority of cells has an extremely predictable behaviour, with limited variability

the number of outliers is significant, and it should be taken into account when designing supply models for car sharing services

(e.g., unpredictable cells should not be taken into account in the redistribution process).

Page 13: Car sharing through the data analysis lens

Howmanydifferentcellusages?• Weusedthefollowingtechniques:

• Wediscretize time into bins with a duration of 10 minutes• We compute the average occupancy (# available vehicles) in each bin across

the observation period• We normalize by the average daily availability in each cell• DynamicTimeWarping:measureshow“close”twotimeseriesare

• Takesintoaccountminorshiftsintimethatcanbeoftenseenintimeseries• PAMClustering:createsk groupsofsimilarstationsbasedontheDTWdistance

• SilhouetteMethod:forselectingthemostinformativek

• The optimal number of clusters in all cities ranges from 2 to 4.

• The fourth cluster, when present, is a very special cluster, composed of just one cell (overlapping with the airportzone)

Page 14: Car sharing through the data analysis lens

Bell shaped: above average availability at night and below average availability

during the day

Inverted bell: above average availability at night and below average availability during the day

Flat: no significant difference in usage is detected over the whole day

Business/commercial areas

Residential areas

Mixed usage

Page 15: Car sharing through the data analysis lens

Identifyingpotentialserviceareas

• Cleaningandmaintenance isacriticaloperationalaspectinCS

• thecarsharingworkforceisdispatchedtocollectvehiclesthatareinneedofeither

• Movingworkersaroundisexpensive,andmoreefficientsolutionscouldbefoundbasedonthevehicleusageinthecity

• POTENTIAL SERVICE AREA:alocationvehiclepassbywithveryhighprobabilitywithinapredefinedtimewindow

• workshopscouldbedeployedinthisarea,andthiswouldmakecleaningandmaintenanceoperationsmuchmoreefficient

Page 16: Car sharing through the data analysis lens

Ourapproach

• WedefineareferencetimewindowW,correspondingtotheacceptedtimebeforetakingoutavehicleformaintenance

• Then,foreachcell,wecountthenumberofdistinctvehiclesseenbythecellsduringW

• Weassumethatathresholdof50%vehicleswouldbeacceptableforthecarsharingoperatortojustifytheopeningofaworkshopinthearea

Page 17: Car sharing through the data analysis lens

Only 2 out of 10 when W=15 days

• In both cases, the service area would be located at the airport!

Only 5 cities out of 10 are able to satisfy this requirement when W=30 days

Page 18: Car sharing through the data analysis lens

Conclusions

• Carsharinghasn’tbeenmuchdata-drivensofar,butdataanalysiscanofferimportantinsightsforthemanagementofacarsharingsystem

• Inthisworkwehaveprovidedsomeexamples,highlightingtheirimpactonCSoperations:

• Wehaveshowntheimportanceofvehicleutilizationrate andhowitcorrelateswithmodalsharesinthecities

• Wehavehighlightedthehugepotentialforvehiclerelocationduetothehighnumberofemptycellsandalsoofidlevehicles

• Wehaveshownthatthedemandisgenerallyverypredictable,andthisalsocanbeexploitedforrelocation

• Wehavediscussedhowtosmartlydeploycleaningandmaintenancefacilities basedonvehicleflowsintheCSnetwork