10
Getting Started with Data Virtualization – What problems DV solves Richard Walker, VP of Enterprise Sales, Denodo

Getting Started with Data Virtualization – What problems DV solves

  • Upload
    denodo

  • View
    195

  • Download
    0

Embed Size (px)

Citation preview

Getting Started with Data Virtualization – What problems DV solves

Richard Walker, VP of Enterprise Sales, Denodo

2

Problem: IT Architecture is Unmanageable

TraditionalIssues

Hi-Data Growth,

IT Complexity, Data Silos, Hi - Latency

New Trends

Real Time,Big Data,

Unstructured Data,

External Data,Move to Cloud

Log files(.txt/.log files)

CRM(MySQL)

Billing System(Web Service - Rest)

Big Data, Cloud(Hadoop, Web)

Inventory System(MS SQL Server)

Product Catalog(Web Service -SOAP)

Customer Voice(Internet, Unstruc)

Product Data(CSV)

ETL

3

Solution: Data Virtualization (DV) = Middleware + DB Technology

Bus. Models - DV is the set of rules and mappings that converts Raw Data into Business Information and Links Info together.

Optimized Query Process -Queries are done in real time so that data is always current & eliminates data replication.

Ent. Governance & Security -Data is filtered based on roles & privileges.

4

4 Key Industry Trends Guides Denodo

• Discovery and self-service BI

• Access to data and metadata in business terms

• Canonical data models

• BigData• New data types• New sources:

Hadoop, Spark, etc.• In addition to

increased agility for traditional BI, needs, etc.

• SaaS• Data

Distribution• Multiple delivery

styles, formats• REST Services

• Logical data abstraction layer

• Lineage and change impact

• Metadata and data services sharing API

• Data services catalog

V6.0

5

Unlimited Connectivity to Any Data TypeRelational DB’s: Oracle, DB2, Sybase, MS SQL Server, MySQL, PostgreSQL, Informix, MS Access…

Parallel DB’s & Appliances: Teradata, Netezza, Oracle Exadata, Sybase IQ, Greenplum, ParAccel…

Multidimensional OLAP Engines: SAP BW, MS SQL Server Analysis Services, Mondrian, Essbase…

SOAP / REST Web Services and Data Feeds: XML, RSS, ATOM, JSON, Odata, Delimited Files – CSV, log files, device feeds, ...

Enterprise Applications: SAP R3 / ECC, Oracle E-Business suite,, Siebel, PeopleSoft, SAS...

Content Management Sys (CMS): MS SharePoint, IBM FileNet, Documentum…

Modeling Tools: Erwin, Rochade, ER Studio…

MDM & Mapping: IBM Initiate, ontologies, taxonomies…

Mainframe / Legacy Connectivity: Adabas, IMS, DB2, TN5250 / TN3270.

Plug-in architecture: third party Mainframe / Legacy Adapters...

Semantic repositories in Triple Stores / RDF accessed via SPARQL endpoints

LDAP and Active Directory: as source data & security access

Big Data / NoSQL: Hadoop, Hive, HCatalog, Impala, Scoop, HBase, PIG, HDFS, MapReduce, AVRO, HDFS, Mongo DB, CouchDB, Neo4J, Cassandra, MarkLogic…

Cloud, SaaS: Salesforce, Google, Amazon, LinkedIn, Facebook, Twitter via APIs; Any Website, Form, any Web based Apps…

Enterprise Service Bus: JMS message queues, WebSphere MQ, Sonic, ActiveMQ…

Custom Connector SDK: access any application via API and procedural interfaces.

Semi-Structured Data: Web sites, Forms, applications, PDF, MS Word, MS Excel

Unstructured Data: websites, file systems, Email servers, databases, knowledgebase, indexes (Lucene, MS FAST, HP Autonomy…), RSS Feeds …

6

Unified Data Layer in the 3 tier Architecture

User interface

Business logic

Data access/data infrastructure

SOA-based services

Data Virtualization

SOA/ESB

On-premise Cloud External

Transaction Data Analytics

Logical Business

Views

Pervasive user

interactionsWeb Mobile Pervasive

Web

7

Complementary Data Integration Architectures

On Demand - DB to AppData Federation from multiple sources

Event Based - App to AppMessage-based, transaction processing

Scheduled – DB to DBBulk data integration

DV EAI/ESB ETL

DV DV

Target/

Data WH

ESB

ESB

8

DV Platform: Business Drivers

Accelerate Project Timelines

Abstraction Layer: De-couple Applications from Enterprise Data – Protect daily operations

Deploy Faster

Use Less Resources

Unified Access to disparate data sources

Minimize data replication, less storage, no ETL

Reuse Canonical Model for multiple projects

Easy Big Data/NoSQL data integration with enterprise data sources

Operational ‘Real-time’ Reporting

360 Degree View of all business data entities

Offer secure business views filtering data by role and privileges

Real time data access, read & write

Support for operational applications

Business Controlled Self Service & Data Discovery

Ad hoc query, browse, and search for Business Users in a secure way

Data presented in a ‘business friendly’ informational format

Drill down through all business associations via links & subscriptions

9

Denodo ROI

Customer Reported Project Savings by Percentage

• Data Integration Cost Reduction

• 60% - 67% Savings

• Traditional, Call Centers, Portals

• 30-70% Savings

• BI & Reporting

• 40-60% Savings

• ETL & DW

• Project timelines reduced from 6-12 months to 3-

6 weeks

• 85% Time Reduction

Thanks!

www.denodo.com [email protected]

© Copyright Denodo Technologies. All rights reservedUnless otherwise specified, no part of this PDF file may be reproduced or utilized in any for or by any means, electronic or mechanical, including photocopying and microfilm, without prior the written authorization from Denodo Technologies.