Степан Кольцов — Message passing: многопоточное...

Concurrency without mutexes

What’s wrong with mutex?

• Hard to write safe code

• Mutexes are slow

• Hard to parallelize

Hard to write safe code

void first() { Guard<Mutex> guard(mutex1); ...}!void second() { Guard<Mutex> guard(mutex2); ...}!void third() { Guard<Mutex> guard(mutex1); ... second(); // possible deadlock}!void fourth() { Guard<Mutex> guard(mutex2); ... first(); // possible deadlock}

void foo() { // must be locked}!

void bar() { Guard guard; foo(); }void baz() { Guard guard; foo(); }void qux() { foo(); }void quux() { Guard guard; qux(); }void corge() { quux(); }// grault does not lockvoid grault() { qux(); }

Mutexes are expensive

• mutex lock/unlock takes about 1us under contention

• under high load it is almost always a contention

• spinlocks are not worse

struct spinlock lock = SPINLOCK_INIT;!

void do_smth() { spinlock_lock(&lock); … spinlock_unlock(&lock);}

Spinlock API

struct spinlock { int locked;};!#define SPINLOCK_INIT { 0 };!void spinlock_lock(struct spinlock* spinlock) { while (!atomic_compare_exchange( &spinlock->locked, 0, 1)) {}}!void spinlock_unlock(struct spinlock* spinlock) { atomic_store(&spinlock->locked, 0, __ATOMIC_SEQ_CST);}

Spinlock impl

Code examples

github.com/stepancheg/no-mutex-c github.com/stepancheg/no-mutex

struct mutex lock = MUTEX_INIT;!

void do_smth() { mutex_lock(&lock); … mutex_unlock(&lock);}

Mutex API

struct mutex { int locked; // 1 if locked int count; // number of threads requesting a lock};!#define MUTEX_INIT { 0, 0 };!void mutex_lock(struct mutex* mutex) { atomic_add_fetch(&mutex->count, 1); while (!atomic_compare_exchange(&mutex->locked, 0, 1)) { futex(&mutex->locked, FUTEX_WAIT, 1); }}!void mutex_unlock(struct mutex* mutex) { int left = atomic_add_fetch(&mutex->count, -1); atomic_store(&mutex->locked, 0); if (left > 0) { futex(&mutex->locked, FUTEX_WAKE, 1); }}

Mutex impl

Numbers

lock cmpxchg 8ns

uncont. mutex lock/unlock 11ns

futex_wake 400ns

cont. mutex lock ~500ns

Hard to parallelize

• We want for some app to use 5 cores. How many threads should we allocate?

There’s a solution!

Message passing/ Actor model

class BlockingQueue<T> { void Enqueue(T elem) { … } // block if empty Vector<T> DequeueAll() { … }}

BlockingQueue

class BlockingQueue<T> { Mutex mutex; CondVar condVar; Vector<T> elements;!

void Enqueue(T elem) { mutex.lock(); elements.push(elem); condVar.signal(); mutex.unlock(); }}

class BlockingQueue<T> { Mutex mutex; CondVar condVar; Vector<T> elements;!

Vector<T> DequeueAll() { mutex.lock(); while (elements.empty()) { condVar.wait(); } Vector<T> r = move elements; mutex.unlock(); return r; }}

Simple message passing with dedicated thread

// non-blocking queue// mutex+condvarBlockingQueue<Request> queue;!

void runProcessingThread() { for (;;) { Vector<Request> requests = Queue.dequeueAll(); // process requests }}!

void start(Request request) { queue.enqueue(request);}

Actors

interface Runnable { void run();}!

interface ThreadPoolExecutor { void submit(Runnable);}

Executor

abstract class Actor { Actor(Executor executor);!

// is not called in parallel protected abstract void act();!

// execute act() // at least once void schedule() { … }}

class MyReqProcessor: Actor { MyReqProcessor(Executor exec) { super(exec); }! NonBlockingQueue<Request> queue;! override void act() { // is not called in parallel Vector<Request> reqs = queue.dequeueAll(); // process reqs }! // may be called from different threads void addWork(Request request) { queue.enqueue(request); schedule(); }}

MyReqProcessor

enum ETaskState { WAITING, RUNNING, RUNNING_GOT_TASKS,};!class Actor: Runnable { Atomic<TaskState> taskState;! void schedule() { if (AtomicSwap(RGT) == WAITING) { executor.submit(this); } }! …}

Actor.schedule

enum ETaskState { WAITING, RUNNING, RUNNING_GOT_TASKS,};!class Actor: Runnable { Atomic<TaskState> taskState;! override void run() { for (;;) { while (CAS(RGT -> RUNNING)) { fetch tasks act } if (CAS(RUNNING -> WAITING) { return; } } }}

Actor.run

Thanks

Степан Кольцов — Message passing: многопоточное...

Technology

Степан Зинин: "Что бизнес хочет от Дизайна?"

Military-Medical Clinical Center of the Southern Region, Odesa, · 2017. 12. 15. · Трихліб Володимир Іванович1, Ткачук Степан ... передових

Nonlinear vibration analysis of steam ... - kpi.kharkov.ua€¦ · Степан Прокопович Тимошенко 1878 - 1972-A grand native of Ukraine, the father ofmodern

5 июня - AppsConf · Application Security — ... Android-приложений — Archetype / Степан Гончаров ... (Яндекс) Flutter vs React: вгляд нативщика

КНИГА ПАМЯТИ - school-152-museum.ruschool-152-museum.ru/files/kniga_pamyati.pdf · Агапов Степан Иванович 23.06.1941 года был призван

Учредительyaguo.ru/files/zhurnal_so_no4_2015.pdf · 2017-10-12 · Степан Александрович прошел все этапы педагогической карьеры:

Лекция 5: Многопоточное программирование: часть 1 (Multithreading programming: multi-core processors, CPU affinity, memory ordering)

СТЕПАН БАНДЕРА - СИМВОЛ НАЦІЇstryirairada.gov.ua/ridne pole/16.01.2015.pdf · 2015-02-13 · Сій в щасливий час золоте зерно! І

QA Lab: тестирование ПО. Степан Максимчук: "Effective Test Design Techniques"

logo corel 12 · Title: logo_corel_12 Author: Бузунов Степан Андреевич Created Date: 1/13/2014 3:48:55 PM

Многопоточное программирование в Java

Многопоточное программирование

Степан РЕВ’ЯКІНzounb.zp.ua/sites/default/files/news/2015/08/gsr.pdf · “Гуляйполе” — книга про українське революційне селянство,

Степан Николаевич Калмыков тел. (495) 939-32-20 …chembaby.com/wp-content/uploads/2014/04/lection-1.pdf · Введение в ... достаточное

Степан Максимчук: “Effective Test Design Techniques”

Rust: код может быть одновременно безопасным и быстрым, Степан Кольцов

Степан Кольцов — Rust — лучше, чем C++

Как мы адаптировали более 150 сайтов по технологии Dynamically-served JavaScript / Артём Цымпов, Евгений Кольцов (eski.mobi)

Степан Семилетов (Arrow Media) - "Современный сервис в контекстной рекламе - ожидания клиента и их реализация

Лекция 8: Многопоточное программирование: Intel Threading Building Blocks