Kubernetes to scale

Kubernetes to Scale

michele.orsi@lastminute.com @micheleorsi

GDG Cloud - London, 11 January 2017

Started with a monolith ...

https://www.flickr.com/photos/southtopia/5702790189

https://www.pexels.com/photo/gray-pebbles-with-green-grass-51168/

... broken into microservices

Micro-problems at scale

● alignment

● real pipelines

● infrastructure

● resilience

● monitoring

● constraints

An year-long endeavour

● build a new, modern infrastructure

● migrate the search (flight/hotel) product there

... without:

● impacting the business● throwing away our whole datacenter

How we did that: technology

● company framework

● docker

● kubernetes

How? Teams and peopleHow we did that: team/people

https://www.pexels.com/photo/blue-lego-toy-beside-orange-and-white-lego-toy-standing-during-daytime-105822/

APP3-PRODUCTION

Kubernetes: our architecture

APP2-PRODUCTIONAPP1-PRODUCTION

APP1-PREVIEW

APP1-DEVELOPMENT

APP1-QA

APP1-STRESSTEST

nonproductionproduction

APP1-PRODUCTION

deployment

replica-set

production

APP1-PRODUCTION

deployment

replica-set

secret configmap

production

APP1-PRODUCTION

deployment

replica-set

(ingress)path: app1-production.prd.lmn.intra

secret configmap

production

nginx-ingress-ctrl: 80

cluster

10.0.0.2

POD10.0.0.1

nginx-ingress-ctrl: 80

POD10.0.0.3POD

10.0.0.4

POD10.0.0.5

POD10.0.0.6

APP1-PRODUCTION

collectd

production

application fluentd

/liveness:

● when tomcat container is up● when “active/max” threads < threshold

/readiness:

● all the startup jobs have run● no termination request has been received

.. ongoing never-ending research ..

Self-healing: our choice for resilience

Kubernetes: what’s left outside?

● datastores

● distributed caches (early 2017)

● distributed locking

● pub-sub/queues

● logs and metrics storage

● zero downtime during rollout

● monitoring in place

● alerting

● centralized logging

● legacy infrastructure to the rescue in case of problem

When can you test with production traffic?

... failure ... at all different levels ..

https://www.flickr.com/photos/ghost_of_kuji/2763674926

Main problems

● configuration

● infrastructure

● tools

● manual mistakes

● (external) scalability

There’s light .. at the end

https://www.pexels.com/photo/grayscale-photography-of-person-at-the-end-of-tunnel-211816/

Pipeline: a huge step forward

microservice = factory.newDeployRequest().withArtifact(“com.lastminute.application1”,2)

lmn_deployCanaryStrategy(microservice,”qa”)

lmn_deployStableStrategy(microservice,”preview”)

lmn_deployCanaryStrategy(microservice,”production”)

pipeline

APP1-PRODUCTION

Monitoring: grafana/graphite/nagios

cluster

graphiteapplication collectd

Grafana

nagios

icons from http://www.flaticon.com

● lead and migration time

● resilience

● root cause analysis

● speed of deployment

● instant scaling

... benefits

● 36 bare-metal nodes (only for production cluster)● 5100 req/sec in the new cluster● 2M metrics/minute flows● 35 micro-services migrated in 5 months

○ 3 new micro-services migrated per week○ 10 minutes to create a new environment

● 11 min to roll-out a new version with 55 instances○ whole pipeline runs in 16 min

Give me the numbers!

Yes, we’re hiring!

THANKS

www.lastminutegroup.com

Kubernetes to scale

Technology

Kubernetes on EGO : Bringing enterprise resource management and scheduling to Kubernetes

KubeCon EU 2016: Monitoring Microservices: Docker, Kubernetes, and GKE Visibility at Scale

Kubernetes Autoscaling on Azure - LF Asia, LLC · Autoscalingin Kubernetes • Horizontal Pod autoscaler(HPA) – Scale number of Pods • Vertical Pod autoscaler(VPA) – Scale resources

Kubernetes Patterns Kubernetes Patterns

Microservices at scale with docker and kubernetes - AMS JUG 2017

Containers at Scale – Kubernetes and Docker - Red Hatpeople.redhat.com/mskinner/rhug/q1.2015/docker-and-kubernetes.pdf · Containers at Scale – Kubernetes and Docker ... registry=registry-host:5000

Efficiently exposing apps on Kubernetes at scale · Some useful best practices for these tools and processes How to use automation to scale the process for multiple apps. ... Needs

Deploy at scale with CoreOS Kubernetes and Apache Stratos

Introduction to Kubernetes

Kubernetes: The Future of Infrastructure€¦ · we wanted to scale our infrastructure to support software development needs, we had to buy more servers and physically scale it. This

APPLICATIONS AND CONTAINERS AT SCALE: OpenShift + Kubernetes + Docker

1. Introduction to Kubernetes 03 - images.linoxide.comimages.linoxide.com/ebook-kubernetes-essentials.pdf · 2 Networking Constraints 98 3 Inspecting and Debugging Kubernetes 98 4

Kubernetes laravel and kubernetes

From Code to Kubernetes

Infrastructure Design for Kubernetes · Kubernetes 1.9 Kubernetes 1.10 Kubernetes 1.11 Kubernetes 1.12 December 2017 March 2018 June 2018 September 2018 Kubernetes 1.13 December 2018

Secure Inference at Scale in Kubernetes* Container …...For stateful sessions, it is beneficial to route the sequential calls from a client to the same Kubernetes pod back end instance

Kubernetes Kubernetes · the 12-Factor App philosophy. Focus on the ability to scale horizontally, observability (metrics, logging, tracing) and resiliency. Docker Images #3 Docker

Cloud-Scale Kubernetes at eBay

OpenShift Container Platform 4.4 Pipelines · integrate with the existing Kubernetes tools, enabling you to scale on-demand. You can use OpenShift Pipelines to build images with Kubernetes

KUBERNETES AND OPENSTACK AT SCALE · upstream test suites: ... - PLACEMENT: "test" # Placement of the WLG pods based on node label ... KUBERNETES AND OPENSTACK AT SCALE #OPENSTACKSUMMIT