Sane Sharding with Akka Cluster

Sane Sharding with Akka Cluster

Michał Płachta

@miciek

Live-coding & performance analysis

What’s inside?

● Creating a web service using actor model● ...analysing its performance● ...making it scalable

Akka Tutorial

● actor ~= thread● actorRef.tell● actorRef.ask● actors create children● actors have mailbox ActorRef

Sender 1 Sender 2

ask tell

enqueue

MailboxActor

dequeue

Scala Tutorial

● case class● pattern matching

case class Junction(id: Int)

public class Junction { private final int id;

public Junction(int id) { this.id = id; }

public int getId() { return id; }

// hashCode // equals // copy}

msg match { case Junction(id) => // this will execute

// when msg is Junctioncase SomeOtherType =>

}

First example: Sorter

scan <containerId> -> HTTP -> push right or not

See also: http://i.imgur.com/mctb4HC.gifv

http://i.imgur.com/mctb4HC.gifv

Sorter Web Service

http://localhost:8080/junctions/<junctionId>/decisionForContainer/<containerId>

returns JSON

{ “direction”: left | right | straight | ... }

Assumptions:

● 5-10 ms to make a decision● business logic already defined - focus on performance

http://localhost:8080/junctions/2/decisionForContainer/1






Let’s code it!

Step 1: Just REST...

RestInterface

HTTP Requests HTTP Responses

● One Actor = One Thread● Blocking inside receive method● Low throughput...

Throughput testing

/junctions/1/decisionForContainer/1 /junctions/2/decisionForContainer/4/junctions/3/decisionForContainer/5/junctions/4/decisionForContainer/2/junctions/5/decisionForContainer/7

2000 requests2000 requests2000 requests2000 requests2000 requests

in parallel

cat URLs.txt | parallel -j 5 'ab -ql -n 2000 -c 1 -k {}'

GNU Parallel ApacheBench

Let’s test it!

Step 1: Just REST...

RestInterface


± % cat URLs.txt | parallel -j 5 'ab -ql -n 2000 -c 1 -k {}' | grep 'Requests per second'

Requests per second: 34.78 [#/sec] (mean)





Let’s improve performance!

Step 1.5: Logic in another actor

RestInterface


SortingDecider

Step 2: One actor per junction

RestInterface


DecidersGuardian

SortingDeciderSortingDecider

SortingDecider

<junctionId>=1 ... <junctionId>=5

Step 2: One actor per junction







Now what?

● non-blocking● concurrent● scaling up works● scaling out?

RestInterface


DecidersGuardian


SortingDecider


Manual scaling out

RestInterface


DecidersGuardian


SortingDecider


RestInterface


DecidersGuardian


SortingDecider


Enter Sharding

RestInterface


ShardRegion


SortingDecider

<junctionId>=h(m) ... <junctionId>=h(m)

RestInterface


ShardRegion


SortingDecider

<junctionId>=h(m) ... <junctionId>=h(m)

...

Let’s shard it!

Step 3: Sharded web service







Sharding

● automatic distribution● no need to know who is where● no need to know how many nodes are there● rebalancing● migration

Thank you!Any questions?

Michał Płachta

@miciek

Technology

Sane Sharding with Akka Cluster