Big Data Architecture for Enterprise Wei Zhang Big Data Architect Up up consultant, LLC

Big Data Architecture For enterprise

Download PDF Report

Upload
wei-zhang
View
38
Download
3

Embed Size (px)

Citation preview

Big Data Architecture for Enterprise

Wei Zhang Big Data Architect

Up up consultant, LLC

Page 2: Big Data Architecture For enterprise

Design Principles

• Future-proof, scalable and auto recoverable, compatible with existing technologies, loose coupled and layered architecture

Page 3: Big Data Architecture For enterprise

Centralized Data Governance service

• Build Schema catalog service to track all data entities and attributes for both structured and unstructured data sets

• Establish and enforce proper practices including solution patterns/design, coding, testing automation and release procedues

Page 4: Big Data Architecture For enterprise

Logical ArchitectureData Transformation and

storageData

Acquisition

Text files Image files XML files EDI files

Event …

Data Distribution

BI Reports Text files

Image files XML files EDI files

Event …

Data Processing Pipeline

Hadoop HDFS MapReduce

Hive Pig

Flume Spark

Java/Scala

NoSql MongoDB Cassandra

Relational Database

MS Sql Oracle MySql

Page 5: Big Data Architecture For enterprise

Logical Architecture• Data lifecycle control, access audit, replication

and DR

• On-desk and in-memory data processing technology stack - sql or nosql database, hadoop map reduce, Spark or ETL tool etc

• Central data inventory services for discovery, tracking and optimization

Page 6: Big Data Architecture For enterprise

Technology Stack

• HDFS, MapReduce, Yarn

• Oozie, Hive, Spark, Kafka, Cassandra, MongoDB

• BI & Reporting, Data acquisition and distribution, Data inventory and data model

Page 7: Big Data Architecture For enterprise

Schema Catalog

• MongoDB schema store

• Schemas, Entities, attributes defined using Arvo format

• Define all Data Sources, destinations including format, transfer protocol, file system, schedule etc

Page 8: Big Data Architecture For enterprise

Data Ledger

• Ledger inventory of all business data set across enterprise

• data set producer and consumer registration

• Data set are tagged and can be queried for traceability and usages

Page 9: Big Data Architecture For enterprise

Data Process and Persistent • Relational database for OLTP, data warehouse

and BI which need to access SQL database and existing systems

• HDFS for source, destination, staging, no structured document, large to huge data processing, data saved in either Arvo or Parquet format for better exchange and performance

• Cassanadra for high frequency, high write transaction systems and MongoDB for document

Page 10: Big Data Architecture For enterprise

Automated and Regression Testing

• Maven, SBT, Junit, Scalatest

Page 11: Big Data Architecture For enterprise

Physical Deployment

• Low End: 7.2 RPM / 75 IOPS, 16 core, 128G (data acquisition and distribution)

• Medium: 15k RPM / 175 IOPS, 24 core, 512G (batch processing)

• High End: 6K - 500K IOPS, 80 core, 1.5T (realtime processing/analytics)

Architecting the Enterprise? Enterprise Architecture · Enterprise Architecture is the Glue between Business Architecture and IT Architecture Business Architecture IT Architecture

Documents

Enterprise Architecture and the Cloud - SNIA Enterprise Architecture: ... Data protection ... Enterprise Architecture and the Cloud

Documents

Enterprise IT Architectures Enterprise Architecture ... · Enterprise IT Architectures Enterprise Architecture – Governance ... Enterprise IT Architectures Enterprise Architecture

Documents

Enterprise Architecture Development - · PDF fileArchitecture Data Architecture ... Enterprise architecture means ... Enterprise Architecture Sample Deliverables

Documents

Enterprise Architecture: A Reconceptualization Is Neededkotusev.com/Enterprise Architecture - A... · Enterprise Architecture: A Reconceptualization Is Needed / Kotusev 2 Pacific

Documents

Enterprise Architecture in the Era of Big Data and Quantum Computing

Data & Analytics

Enterprise Architecture Assessment Guide v2 · PDF fileExtended Enterprise Architecture Maturity Model Support Guide Enterprise Architecture Score card Enterprise Architecture Assessment

Documents

An effective Enterprise Architecture Implementation ... · An effective Enterprise Architecture Implementation ... 2.2 Enterprise Architecture Implementation Methodology ... The Open

Documents

UNITED STATES DISTRICT COURT EASTERN DISTRICT OF …An Enterprise Architect’s Guide to Big Data – Reference Architecture Overview, ORACLE ENTERPRISE ARCHITECTURE WHITE PAPER at

Documents

The Need for Enterprise Architecture for Enterprise-Wide ...The Need for Enterprise Architecture for Enterprise-Wide Big Data ... single-minded focus on deliverables is problematic

Documents

Big Data and Big Insights - Aalto Universityinformation.aalto.fi/en/research/ressem/big_data_final.pdf · Big Data, Big Insights How BigInsights fits into an enterprise data architecture

Documents

Best and Worst Enterprise and Application ArchitectureBest and Worst Enterprise and ... Big Data Impact on EA Enterprise Solution Architecture ... Enterprise Architecture Leaders Focus

Documents

Enterprise Architecture is Evolution. Outline The evolution of Enterprise Architecture: The Enterprise Architecture as metaphor Enterprise Architecture,

Documents

WUSTL Enterprise Architecture Principlescio.wustl.edu/.../WUSTL-Enterprise-IT-Architecture-Principles-BYU.pdfWUSTL Enterprise Architecture Principles ... Information/Data Architecture

Documents

Enterprise Architecture for Dummies - TOGAF 9 enterprise architecture overview

Technology

Lenovo Big Data Reference Architecture for Cloudera Enterprise

Documents

Enterprise Architecture for Architecture Driven Planning ...sparxsystems.com/press/articles/pdf/EAforADP.pdf · Enterprise Architecture for Architecture ... Enterprise Architecture

Documents

Enterprise Architecture and the Cloud - SNIA Architecture: An enterprise architecture (EA) is a rigorous description of the structure of an enterprise. ... Enterprise Architecture

Documents

Enterprise Architecture Guide - Queensland Health · Enterprise Architecture Guide 4.1 The Department of Health Enterprise Architecture The Department of Health Enterprise Architecture

Documents

Enterprise Architecture and SOA - Digital Transform · Enterprise Architecture and SOAEnterprise Architecture and SOA ... Enterprise ArchitectureEnterprise Architecture Service ManagementService

Documents

Enterprise Architecture of Emergent Complex … Architecture of Emergent Complex Adaptive Systems ... The Enterprise • Approach as Enterprise ... • An enterprise architecture and

Documents

Automation Anywhere Enterprise v11 Architecture ......Automation Anywhere Enterprise v11 Architecture & Implementation Guide ... Automation Anywhere Enterprise v11 Architecture & Implementation

Documents

Enterprise Architecture –An Overveics9117/2004 Student Seminars/Week 11... · John Zachman’s Enterprise Architecture ... Functioning Enterprise/User’s View ... Enterprise Architecture

Documents

Establishing an Enterprise Architecture (EA) Practicebernus/publications/pdfs/EstablishinganEA... · Establishing an Enterprise Architecture (EA) Practice Enterprise Architecture

Documents

Big Data Governance and Enterprise Architecture - Penang Dr... · Big Data Governance and Enterprise Architecture ... the ability to leverage trusted data for better service,

Documents

Enterprise Architecture Enterprise Apps

Documents

How Enterprise Architecture Supports Unstructured Big …dama-ny.com/images/meeting/061214/DAMA_Day/bigdata.pdf · John A. Zachman Zachman International Enterprise Architecture How

Documents

Enterprise architecture improvement for virtual enterprise ...ieomsociety.org/ieom2017/papers/517.pdf · 2.2 Overview of Enterprise Architecture Enterprise architecture is a relevant

Documents

Enterprise Architecture Approaches to Big Data

Documents

Enterprise Architecture TOGAF - Introduction to... · Introduction to Enterprise Architecture Enterprise Architecture TOGAF ... Architecture and design operates at all three ... the

Documents