21
Big Data makes me grumpy Why have we built Valo?

Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

Embed Size (px)

Citation preview

Page 1: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

Big Data makes me grumpy

Why have we built Valo?

Page 2: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

First and foremost…

We are a software company

We are not systems integrators…We are not a consultancy…We are not a bespoke development center…

Who are we?

Page 3: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

First and foremost…

We are a software company

Who are we?

We are innovatorsWe are humble and hungryWe do have a point of view

Page 4: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

We build the best software solutions we can

Simple, but not simplerUsability comes firstResilientContinuous improvement

What do we do?

Page 5: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

Stack for Streaming Data…

SYSLOG

SNMP

ETW

Sources

.

.

Agent A

Agent B

Agent N

Capture

.

.

A Topic

B Topic

N Topic

Distribution

.

.

Topology

A

Topology

B

Topology

N

Real Time

.

.

Hive

Elastic

HBase

Storage

Analytical

Tools

UI

Web

Access

Access

Page 6: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

Stack for Streaming Data…

Page 7: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

Grumpy cat questions?

How do we…

… add security? … ensure best performance?

… add auditability? … work with data abstractions?

… monitor it? … add new algorithms?Σ

Page 8: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

Maybe it is time to…

…Think again

Simple, but not simplerUsability firstResilientContinuous improvement

Page 9: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

I would like…

Streaming system

Grumpy cat shopping list

No (micro) batchesStream semantics

Page 10: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

I would like…

Streaming systemStrong data centric solution

Grumpy cat shopping list

Rich type systemAnalytical results are streamsMultiple data formatsData validationImmutable

Page 11: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

I would like…

Streaming systemStrong data centric solutionOne cluster

Grumpy cat shopping list

DecentralizationSymmetryIncremental scalabilityHeterogeneous Partition & replicationAdministrator friendly

Page 12: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

I would like…

Streaming systemStrong data centric solutionOne clusterOpen

Grumpy cat shopping list

APIs everywhereREST architecture

Page 13: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

I would like…

Streaming systemStrong data centric solutionOne clusterOpenML friendly

Grumpy cat shopping list

Write algorithms only onceCompatible with tools like R, Matlab, Python

Page 14: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

I would like…

Streaming systemStrong data centric solutionOne clusterOpenML friendlySecure & auditable

Grumpy cat shopping list

Perfect forward secrecyData encrypted on diskClean monitoring

Page 15: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

And my senior team reaction was…

Page 16: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

We did it!

Grumpy cat no more…

Page 17: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

Before you attempt to replicate our craziness…

This is hard. Seriously hard.

Amazing team

Still… If you are crazy enough…

Word of advice

Page 18: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

Valo …

… is a dynamo inspired cluster

… incorporates built-in repositories

… exposes a rich API and data set

… polyglot

… unifies historical and stream processing

So, what is Valo?

Page 19: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

Shall we see it?

Quick demo…

Page 20: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

Future

Is Valo finished?

No! Continuous improvement, remember?

Focus on ML algorithms

Add an J/ODBC driver

Python / R integration

Formally publish all the SDKs

Data acquisition

Page 21: Big Data makes me grumpy, Justo Ruiz Ferrer, CTO @ Valo

Thank youvalo.io