Upload
j-on-the-beach
View
384
Download
0
Embed Size (px)
Citation preview
Big Data makes me grumpy
Why have we built Valo?
First and foremost…
We are a software company
We are not systems integrators…We are not a consultancy…We are not a bespoke development center…
Who are we?
First and foremost…
We are a software company
Who are we?
We are innovatorsWe are humble and hungryWe do have a point of view
We build the best software solutions we can
Simple, but not simplerUsability comes firstResilientContinuous improvement
What do we do?
Stack for Streaming Data…
SYSLOG
SNMP
ETW
Sources
.
.
Agent A
Agent B
Agent N
Capture
.
.
A Topic
B Topic
N Topic
Distribution
.
.
Topology
A
Topology
B
Topology
N
Real Time
.
.
Hive
Elastic
HBase
Storage
Analytical
Tools
UI
Web
Access
Access
Stack for Streaming Data…
Grumpy cat questions?
How do we…
… add security? … ensure best performance?
… add auditability? … work with data abstractions?
… monitor it? … add new algorithms?Σ
Maybe it is time to…
…Think again
Simple, but not simplerUsability firstResilientContinuous improvement
I would like…
Streaming system
Grumpy cat shopping list
No (micro) batchesStream semantics
I would like…
Streaming systemStrong data centric solution
Grumpy cat shopping list
Rich type systemAnalytical results are streamsMultiple data formatsData validationImmutable
I would like…
Streaming systemStrong data centric solutionOne cluster
Grumpy cat shopping list
DecentralizationSymmetryIncremental scalabilityHeterogeneous Partition & replicationAdministrator friendly
I would like…
Streaming systemStrong data centric solutionOne clusterOpen
Grumpy cat shopping list
APIs everywhereREST architecture
I would like…
Streaming systemStrong data centric solutionOne clusterOpenML friendly
Grumpy cat shopping list
Write algorithms only onceCompatible with tools like R, Matlab, Python
I would like…
Streaming systemStrong data centric solutionOne clusterOpenML friendlySecure & auditable
Grumpy cat shopping list
Perfect forward secrecyData encrypted on diskClean monitoring
And my senior team reaction was…
We did it!
Grumpy cat no more…
Before you attempt to replicate our craziness…
This is hard. Seriously hard.
Amazing team
Still… If you are crazy enough…
Word of advice
Valo …
… is a dynamo inspired cluster
… incorporates built-in repositories
… exposes a rich API and data set
… polyglot
… unifies historical and stream processing
So, what is Valo?
Shall we see it?
Quick demo…
Future
Is Valo finished?
No! Continuous improvement, remember?
Focus on ML algorithms
Add an J/ODBC driver
Python / R integration
Formally publish all the SDKs
Data acquisition
Thank youvalo.io