38

No Time-Outs: How to Empower Round-the-Clock Analytics

Embed Size (px)

DESCRIPTION

The Briefing Room with Rick Sherman and Actian Slides from the Live Webcast on Aug. 28, 2012 The appetite for high-powered analytics is greater than ever these days, with increasing numbers of business users clamoring for insights. At the same time, source systems are proliferating, and the nature of questions being asked is getting more complex. Indeed, the entire landscape of analytics is changing in fundamental ways. How can your organization stay ahead of the curve? Register for this episode of The Briefing Room to learn from veteran Analyst Rick Sherman how a variety of technologies can change the manner in which analytics are done. He'll be briefed by Fred Gallagher of Actian, who will explain how his company's Vectorwise technology leverages vector processing to expedite even the most complex queries when compared to traditional columnar or relational databases. For more information visit: http://www.insideanalysis.com

Citation preview

Page 1: No Time-Outs: How to Empower Round-the-Clock Analytics
Page 2: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

[email protected]

8/28/12

Page 3: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

!  Reveal the essential characteristics of enterprise software, good and bad

!  Provide a forum for detailed analysis of today’s innovative technologies

!  Give vendors a chance to explain their product to savvy analysts

!  Allow audience members to pose serious questions... and get answers!

Page 4: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

!  August: Analytics

!  September: Integration

!  October: Database

!  November: Cloud

!  December: Innovators

Page 5: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

!  Analytics has always been about discovering insights that lead to better business decisions.

!  More organizations are demanding faster time-to-

insight, while at the same time expecting connectivity to and analytics on a wide variety of data and data sources.

!  Clever vendors look for ways to provide solutions that not only scale at lightening speeds, but deliver actionable insights.

Page 6: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

Robin Bloor is Chief Analyst at

The Bloor Group.

[email protected]

Page 7: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

Page 8: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

Rick Sherman is the founder of Athena IT Solutions, a Massachussetts-based firm that provides business intelligence, data integration & data warehouse consulting, training and vendor services. In addition to having more than 20 years of experience in BI solutions, Rick writes on IT topics and is a frequent speaker at industry events. He blogs at The Data Doghouse and can be reached at: [email protected]

Page 9: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

! Actian Corporation is a database and software development company.

!   Its premier database platform is Vectorwise, a highly performant analytic engine that implements parallelism at every level, from the processor core to data storage.

! Actian offers a cloud development platform for building Action Apps, lightweight applications that automate business actions triggered by real-time changes in data.

Page 10: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

Fred Gallagher is GM, Vectorwise at Actian Corporation and is responsible for managing the business activities for this breakthrough product. He joined Actian in 2006 as vice president of business development.   Before joining Actian, Fred worked for Qlusters, where he was responsible for worldwide sales, marketing, and business development. At Qluster, he successfully launched the industry's first open source systems management project. Prior to that, he worked at VMWare, where he was responsible for worldwide software alliances, and where he established 15 successful strategic alliances during a high-growth period of two years. Previously he was at Seagate Technology, where he was vice president of worldwide channels and business development for Seagate's XIOtech subsidiary. Fred holds Bachelor of Arts and an MBA from Stanford University.

Page 11: No Time-Outs: How to Empower Round-the-Clock Analytics

Briefing Room

Speakers: Rick Sherman

Fred Gallagher, General Manager Vectorwise, Actian Corporation

August 28

Page 12: No Time-Outs: How to Empower Round-the-Clock Analytics

Actian Today

12

•  Global  Reach,  Growing  with  Strong  Balance  sheet  •  Highly  Profitable  with  strong  Cash  Balances  •  200  employees  across  11  Offices  

   

• World’s  Fastest    Big  Data  Analy5cal  Engine  •  Experiencing  RAPID  GROWTH  •  Affordable,  Leverages  Standard  Hardware  and  SoHware  

Vectorwise  

•  10,000+  mission  cri5cal  applica5ons  –  inc  Data  Warehouses    •  Very  high  client  sa5sfac5on  •  S5ll  Innova5ng:  GeoSpa5al  features  

Ingres  

•  Next  Genera5on  of  Ac5onable  BI  –  Connect  Insight  to  Ac5on    •  Analyzing  Events  and  Data    Ac5on  Apps  

Page 13: No Time-Outs: How to Empower Round-the-Clock Analytics

But  most  enterprises  s5ll  only  use  1-­‐5%  of  their  data  

What  if  that  number  doubled,  tripled,  quadrupled…  ?  Source:  Forrester  

Page 14: No Time-Outs: How to Empower Round-the-Clock Analytics

Enormous Opportunities for Big Data

14

$300bn  value  per  year   €250  bn  value  per  

year  $600  bn  value    per  year  

SOURCE:  McKinsey  Global  InsNtute  analysis  

Page 15: No Time-Outs: How to Empower Round-the-Clock Analytics

! Data silos, distributed and disparate

! Existing solutions are not real-time

! Expensive, inefficient or inflexible

! Scalability not there

“Big” Data Management Challenge

Local Data

Data in the Cloud

15

Page 16: No Time-Outs: How to Empower Round-the-Clock Analytics

The Need for Speed and Agility

16

! Business users require interactivity

!   Desktop users tolerate 10 to 20 seconds

!   Mobile users tolerate 2 to 5 seconds

! Increases in user concurrency

! Take action in “business time”

!  Be faster than your competition

!  Optimize your business

Page 17: No Time-Outs: How to Empower Round-the-Clock Analytics

Reduce Latency Between Events and Action

 

 

Value  

Time  latencies  

Latency  

Analyze  

A*ribu/on:  Jean-­‐Michel  Franco  of                                        Business  &  Decision  

Event  

Capture  

Latency  

Latency  

Ac5on  

Informed  decision  ready    to  be  made  

17

Page 18: No Time-Outs: How to Empower Round-the-Clock Analytics

0   100,000   200,000   300,000   400,000  

Vectorwise                                                                                                                                                                                                445,529  

Vectorwise                                                                                                                                                                                            436,788  

SQL  Server                                                        219,888  

Oracle                                                                  209,534  

Oracle                                                              201,487  

SQL  Server                            173,962  

Sybase  IQ                            164,747  

Oracle                          140,181  

SQL  Server    134,117  

June  ‘12  

May  ‘11  

Aug  ‘11  

June  ‘11  

Sept  ‘11  

Apr  ‘11  

Dec  ‘10  

Apr  ‘10  

Dec  ‘11  

QphH  

Fastest TPC-H QphH@1TB Benchmark (non-clustered) Source:  www.tpc.org  /  June  15,  2012  

Vectorwise: Affordable Performance – Proven!

18

Page 19: No Time-Outs: How to Empower Round-the-Clock Analytics

0   100,000   200,000   300,000   400,000  

Vectorwise                                                                                                                                                                                                445,529  

Vectorwise                                                                                                                                                                                            436,788  

SQL  Server                                                        219,888  

Oracle                                                                  209,534  

Oracle                                                              201,487  

SQL  Server                            173,962  

Sybase  IQ                            164,747  

Oracle                          140,181  

SQL  Server    134,117  

June  ‘12  

May  ‘11  

Aug  ‘11  

June  ‘11  

Sept  ‘11  

Apr  ‘11  

Dec  ‘10  

Apr  ‘10  

Dec  ‘11  

$57,146  

$1,229,968  

$460,869  

$2,402,706  

$753,392  

$278,527  

$85,621  

$1,249,967  

$258,880  

Hardware  Cost  (excluding  discounts)  QphH  

Fastest TPC-H QphH@1TB Benchmark (non-clustered) Source:  www.tpc.org  /  June  15,  2012  

Vectorwise: Affordable Performance – Proven!

19

Page 20: No Time-Outs: How to Empower Round-the-Clock Analytics

Vectorwise Technology

Confidential © 2012 Actian Corporation 20

Vector  Processing  Single  InstrucNonMulNple  Data  

2nd  Gen  Column  Store  Limit  I/O  Efficient  real  Nme  updates  

Smarter  Compression  

Maximize  throughput  Vectorized  decompression  

On  Chip  CompuNng  

Tim

e / C

ycle

s to

Pro

cess

Data Processed

DISK  

RAM  

CHIP  

10GB  2-­‐3GB  40-­‐400MB  

2-­‐20  

150-­‐250  

Millions  

Process  data  on  chip  –  not  in  RAM  

1  

2  

3  

4  

MulN-­‐core  Parallelism  

Maximize  system  resource  uNlizaNon

5  

…  

Page 21: No Time-Outs: How to Empower Round-the-Clock Analytics

Customer Stories: Sheetz and Zoho

!SaaS Company with 6 million Users ! Problem and Requirements

!   Customer data growing rapidly !   Ease of use for self-service BI !   Affordability !   200,000 users of Zoho Reports

Vectorwise results !   Exceptional performance !   Affordability for a SaaS offering

!Leader in Convenience Stores ! Problem

!   Multiple data sources

!   Need to control costs

!   Huge data growth

! Vectorwise results !   Expand data to analyze two years

!   Manage growth for three years

21

Page 22: No Time-Outs: How to Empower Round-the-Clock Analytics

Hadoop/Vectorwise Production Use Cases

Confidential © 2012 Actian Corporation 22

! NK !   Social media !   Optimize user experience and ad revenue

!   250 TB in Hadoop !   Extracting ~5 TB into Vectorwise

Page 23: No Time-Outs: How to Empower Round-the-Clock Analytics

Customer Stories: Badoo

!Fastest Growing Social Network ! Problem

!   Limited slice and dice analytics

!   Better target ad campaigns

!   Huge data growth

! Vectorwise results !   Detailed answers in seconds

!   Immediate actions

23

Page 24: No Time-Outs: How to Empower Round-the-Clock Analytics

Summary

24

! Successful businesses require speed and agility

! BI solutions must address these requirements

! Recommendations for how to get started and succeed:

!   Align IT goals and organization with user needs and business goals

!   Include operational processes in requirements (business and IT)

!   POC with Vectorwise for affordable performance and scalability

Page 25: No Time-Outs: How to Empower Round-the-Clock Analytics

For more information and to download a free trial of

Vectorwise visit: www.actian.com/vectorwise

Contact us at:

1 877-644-6343

[email protected]

Join the conversation:

Twitter: @Actiancorp #Vectorwise

Confidential © 2011 Actian Corporation 25

Page 26: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

Page 27: No Time-Outs: How to Empower Round-the-Clock Analytics

Copyright © 2012 Athena IT Solutions

Rick  Sherman  Athena  IT  SoluNons  617-­‐835-­‐0546  (C)  rsherman@athena-­‐soluNons.com    

The  Changing  Face  of  Analy1cs:    How  to  Stay  Ahead  

 

The  Briefing  Room  

Page 28: No Time-Outs: How to Empower Round-the-Clock Analytics

Slide 28 Copyright © 2012 Athena IT Solutions All rights reserved.

The Changing Face of Analytics: How to Stay Ahead State of Data & Analytics

•  Data Exploding (sometimes Big Data) ü  Volume: ever accelerating data

û  90% of data in world created in last two years ü  Velocity: time-sensitive, real-time ü  Variety: structured & unstructured data such as:

text, audio, video, click streams, log files

•  Analytics has Expanded ü  Pre-built reports ü  Business Graphics, Trending, Drill-down to Details ü  Spreadsheets pervasive ü  Predictive Analytics ü  Data Visualization ü  Operational BI ü  Mobile BI ü  Self-Service BI

Page 29: No Time-Outs: How to Empower Round-the-Clock Analytics

Slide 29 Copyright © 2012 Athena IT Solutions All rights reserved.

The Changing Face of Analytics: How to Stay Ahead Response to BI Demand

•  Infrastructure ü  Software: Database, ETL, BI ü  Traditional servers dedicated to above ü  Storage: local, SAN, NAS

•  Data Architecture ü  Source Systems ü  ETL: daily or “near” real-time updates ü  DW & Data Marts (in Hub & Spoke) – Relational Database & OLAP ü  Many BI data stores created & tuned for specific analytics

•  Analytics Architecture ü  BI Suites: Dashboards, Reporting, Office Integration & Mobile ü  Data restructured for business processes & BI tool(s) ü  OLAP (On-Line Analytical Processing) for Power Users ü  Spreadsheets primary BI tool for business people

Page 30: No Time-Outs: How to Empower Round-the-Clock Analytics

Slide 30 Copyright © 2012 Athena IT Solutions All rights reserved.

The Changing Face of Analytics: How to Stay Ahead State of Business Analytics & BI Today

Page 31: No Time-Outs: How to Empower Round-the-Clock Analytics

Slide 31 Copyright © 2012 Athena IT Solutions All rights reserved.

The Changing Face of Analytics: How to Stay Ahead Need to Change Approach

•  Emerging Technologies have concentrated on: ü Data Variety ü  Complexity of Data & Analytics ü  Skills Shortfall

•  Emerging Technologies ü  Columnar Databases ü Massively Parallel Processing (MPP) ü Data Virtualization ü  In-Database Analytics ü  In-Memory Analytics ü  BI Analytical Appliances ü  Cloud-based Applications

Page 32: No Time-Outs: How to Empower Round-the-Clock Analytics

Slide 32 Copyright © 2012 Athena IT Solutions All rights reserved.

The Changing Face of Analytics: How to Stay Ahead BI Analytical Appliances

•  Appliances have evolved ü  Special purpose “devices” ü  Target DW, BI/Analytics & Database Processing

•  Architectures Vary Widely ü  Hardware & Software vs Software Only ü  Hardware:

û  Commodity vs Proprietary û  Commodity with selected specialized components

ü  Software û  Proprietary versus Open Source û  Open to selected DB, ETL and/or BI partners

ü  Emerging Technologies û  MPP, Columnar, In-Database Analytics, In-Memory Analytics & Cloud

Appliances •  Benefits: Lower TCO, Reliability, Scalable, Extensible

Page 33: No Time-Outs: How to Empower Round-the-Clock Analytics

Slide 33 Copyright © 2012 Athena IT Solutions All rights reserved.

The Changing Face of Analytics: How to Stay Ahead Speaker’s Expertise & Experience

•  Experience ü  25 years of DW & BI experience ü  30 years relational database experience ü  Consulting, IT and Software Engineering

•  Consulting ü  Business & IT Groups ü  Software Vendors

•  Instructor ü  Northeastern University, Graduate School of Engineering ü  DW & BI Conferences; DW & BI Courses

•  Writer, Columnist, Blogger ü  200+ Published Articles ü  White papers, Webinars, Podcasts & Seminars ü  DataDoghouse.com Blog on BI/DW industry

•  Thought Leadership: ü  TDWI – Boston User Group Officer ü  Boulder BI Brain Trust

Page 34: No Time-Outs: How to Empower Round-the-Clock Analytics

•  The query patterns for BI business analytics are much different than transactional processing. What are the key differences? How do you address them?

•  Traditionally BI implementations required a sophisticated data architecture including a DW, data marts (dimensional), OLAP, “flattened” datasets, aggregated/summarized tables and other reporting data stores. Also maybe an ODS (operational data store), staging tables and various data shadow systems. Do you reduce the complexity of the traditional data architecture?

•  A key component of developing the business analytics is to define what data the business needs, how they plan to analyze on it, design the queries, tune the database, etc. And then do it again for each query. How do you change that?

•  Business analytics typically involves a variety of BI tools such as reports, dashboards, scorecards, ad-hoc analytics, data visualization, data discovery and predictive analytics. How do you interact with these tools?

Twitter Tag: #briefr

Page 35: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

•  Business analytics/BI, data integration and DW requires a lot of varied skills. What type of skills are needed to successfully implement your solutions? Do you raise the increase on skills needed for implementation?

•  The limiting factor on many enterprise-wide BI and DW programs has been cost, but emerging technology is perceived as more expensive. How do you lower the TCO?

•  Assume that most enterprises have a DW (maybe even MDM) in order to enable consistent and conformed data. How does your solution leverage the DW? Does you solution lessen the need for a DW?

•  There was a lot of hype regarding BI Appliances a while ago and many vendors used that term to label various hardware & software combinations. From the hype, what has emerged to impact BI & how? What are the “pretender” technologies that have not fulfilled on hype (no vendor names!)?

Page 36: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

Page 37: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr

!  August: Analytics

!  September: Integration

!  October: Database

!  November: Cloud

!  December: Innovators

Page 38: No Time-Outs: How to Empower Round-the-Clock Analytics

Twitter Tag: #briefr