15
…optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

…optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

Embed Size (px)

Citation preview

Page 1: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

…optimise your IT investments

Warehousing for low latency analytics

Philip HowardResearch Director – Bloor Research

Page 2: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

Agenda

What are low latency analytics?

The inverse quantity/latency relationship problem

What types of application need them?

What is required in query and analytic terms?

What’s special about low latency analytics?

What database structures and features will be useful?

Page 3: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

What is low latency?

A spectrum of query requirements typically ranging up to a few

minutes in duration

Page 4: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

The quantity/latency issue

Typically:

Large volumes to be ingested in (close to) real-time

Often involve large quantities of historic data

Analytics are often complex

Analytics are often ad hoc

High performance requirement

photo by Amnemona

Page 5: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

Low latency applications

Social media, social networking, web comparison sites, recommendation engines ….

Primarily about understanding customers and influencers

Latency varies depending on requirements

Page 6: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

Low latency applications

On-line gaming: casinos and video gaming

Typically about up/cross-selling but also fraud

The latter tends to have lower latency requirements

Page 7: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

Low latency applications

Telecommunications

Various applications including traffic analysis, mobile advertising (especially location specific analytics),

re-pricing and fraud

Can be very low latency e.g. mobile advertising

Page 8: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

Low latency applications

Real-time log and event management

Particularly important (and very low latency) for monitoring and responding to security threats such as cyber attacks

Page 9: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

Low latency applications

Real-time web analytics

Important to support on-line marketing campaigns and dropped shopping cart re-marketing

Typically a few minutes latency is fine

Page 10: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

Low latency applications

Others include:Fraud prevention

Capital markets

Network monitoring

Sensor-based applications

Page 11: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

Query and analytic requirements

Often require complex analytics with:Large table scansMulti-way joinsCorrelated sub-queries…

Often cannot be predicted in advance

Even in monitoring environments may require scoring against a model

Page 12: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

What’s special about low latency analytics?

Not just about (real-time) load speeds and ingestion rates

Not just about query performance

Also about what’s in-between

Having data available in memory before it is stored on disk

Standardising the data – may be multiple formats

Writing to disk

Page 13: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

Database features

High performing and scalable loading

Trickle feed, micro-batch or CDC

Agility (data sources)

No indexes, summary tables etc.

Avoid whole table scans

Performance speed-up

In-memory capabilities

High availability

Page 14: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010

Conclusion

Analytic warehouses and marts for low latency applications have all the same requirements as for any other analytic environment

But they also require not just high ingestion rates and fast query performance, but high performance in the intermediate step(s) between ingestion and reading data off disk

Page 15: …optimise your IT investments Warehousing for low latency analytics Philip Howard Research Director – Bloor Research

telling the Information Management storyConfidential © Bloor Research 2009 telling the right storyConfidential © Bloor Research 2010