Nyc kafka meetup 2015 - when bad things happen to good kafka clusters

When Bad Things Happen toGood Kafka Clusters

True stories that actually happened to production Kafka clustersAs told by

Gwen Shapira, System Architect@gwenshap

DisclaimerI am talking about other people’s systemsNot yours.I am sure you had perfectly good reasons to configure your system the way you did. This is not personal criticismJust some stories and few lessons we learned the hard way

POCs are super easyIts time to go production

We keep our data in/tmp/logs

What can possible go wrong?

Replication-factor of 3 is way too much

__consumer_offsets topic?

Never heard of it, so its probably ok to delete.

What’s wrong with running Kafka 0.7?

Remember that time when…We accidentally lost all our data?

We added new partitions…And immediately ran out of memory

We wanted to lookup records by timeThe smaller the segments, the more accurate the lookups

So we created 10k segments.

We need REALLY LARGE messages

We just serialize JSON and throw it into a topic.It’s easy.The consumers will figure something out.

Log4J is a great way to reliably send data to Kafka

Keep your Kafka safe!“When it absolutely, positively has to be there:

Reliability guarantees in Apache Kafka”

Wednesday, 11:20am, Room 3D

Thank you

Visit Confluent in booth #929Books, Kafka t-shirts & stickers, and more…

Gwen Shapira | gwen@confluent.io | @gwenshap

Nyc kafka meetup 2015 - when bad things happen to good kafka clusters

Data & Analytics

NYC eCommerce Meetup (2.11.15)

Apache Kafka Reliability Guarantees StrataHadoop NYC 2015

Authorization in Apache Kafka - Seattle Kafka Meetup - Ashish Singh

London Apache Kafka Meetup (Jan 2017)

Microservices At Gilt - NYC Microservices Meetup

NYC Meetup November 15, 2012

Creative Commons GNU/Linux NYC Meetup

Adaptive Blue Java Nyc Meetup

Paris Kafka Meetup - How to develop with Kafka

NYC Lucene/Solr Meetup: Spark / Solr

Kafka blr-meetup-presentation - Kafka internals

Talk at NYC Python Meetup Group

Apache Kafka DC Meetup: Replicating DB Binary Logs to Kafka

Titan NYC Meetup March 2014

Kafka on YARN (KOYA) at Slider Meetup 20150304

Seattle kafka meetup nov 2015 published siphon

Adaptive Blue Sem Tech Meetup Nyc

Blogging 101 - Zemanta NYC Meetup

Spark Streaming with Kafka - Meetup Bangalore

NYC Semantic Web Meetup - Aug 2009