23
www.anant.us | [email protected] | 202.905.2818 1010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007 Streamline Organize Unify Business Information Scaling Cloud Web & Data Technologies Distributed Platform: Spark, Cassandra, Kafka, Docker

Scaling Cloud Web & Data Technologies

Embed Size (px)

Citation preview

Page 1: Scaling Cloud Web & Data Technologies

www.anant.us | [email protected] | 202.905.28181010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007

Streamline Organize UnifyBusiness Information

Scaling Cloud Web & Data TechnologiesDistributed Platform:

Spark, Cassandra, Kafka, Docker

Page 2: Scaling Cloud Web & Data Technologies

• Who are these guys? (Intro)• Why Distribute Everything?• Spark: Distributed Computing• Cassandra: Distributed Data• Kafka: Distributed Queues• Docker: Simplified Code Distribution• Questions ?

Agenda

Page 3: Scaling Cloud Web & Data Technologies

Who am I. What I did. What I do.

Built a Hosting CompanyWeb Application Dev/HostingIT Consulting w/ Firms/Agencies……………………………………...……...Management Consulting for Internet Teams + Difficult Data

Page 4: Scaling Cloud Web & Data Technologies

What we do well.

Streamline, Organize & Unify

Business Information

Page 5: Scaling Cloud Web & Data Technologies

What is Business Information

Internal vs. External (Public vs. Private)Strategic / Tactical / OperationalData / Information / KnowledgeDocuments / Records / Metrics

Project

InformationClient Service

Information

Corporate

Guides

Collaborative

Documents

Assets

& Files

Corporate

Assets

Page 6: Scaling Cloud Web & Data Technologies

Phases of Business Technology

Business

Silos

Standardize

d Platform

Optimized

Core

Business

Modularity

Page 7: Scaling Cloud Web & Data Technologies

Your Business Cloud is Disconnected

… so you and your team spend waste time ...

Page 8: Scaling Cloud Web & Data Technologies

Bring it all together in One Place

Project

InformationClient Service

Information

Corporate

Guides

Collaborative

Documents

Assets

& Files

Corporate

Assets

Konotree

We will help you connect and organize your data.

Page 9: Scaling Cloud Web & Data Technologies

find. analyze. act.

FIND

DISCOVER

FILTER

ANALYZE

VISUALIZE

MEASURE

ACT

USE

SHARE

Page 10: Scaling Cloud Web & Data Technologies

Why Distribute Everything?

• Pros – Scale– Speed– Concurrency– Volume– Modularity

• Cons – Complexity– Latency– Parity

Page 11: Scaling Cloud Web & Data Technologies

Systems Amateurity

• Gonna host a ..:

– Project Management System ?

– Chat Server ?

– Frontend / Backend Framework ?

– Blog System / CMS / eCommerce ?

– Social Network ?

Page 12: Scaling Cloud Web & Data Technologies

Systems Amateurity

• Equivalent to restaurants making their own pots and pans

• Common in many teams - small to large

• “I can SELF-HOST EVERYTHING”

Page 13: Scaling Cloud Web & Data Technologies

Kafka

• 20-30 Times More Throughput

• PubSub or Message Queue

• Persistent w/ Offset (Replays)

• Messaging / ESB / Integration

• Seen in real world ~ 15TB / Day

Page 14: Scaling Cloud Web & Data Technologies

Spark

• Extremely Fast Processing

• Program in Java, Scala, Python, R, or C#

• Sophisticated Algorithms Available: PageRank in 5 lines.

• Real Time Stream Processing

• Awesome Integration with Hadoop or Other Data Sources

Page 15: Scaling Cloud Web & Data Technologies

Cassandra

• Massive Amounts of Data Quickly

• Distributed Architecture - Master - Master

• Tunable Consistency - CAP

• CQL ~ SQL - Familiar / Similar Syntax

• Easily Scale DB with Additional Nodes

• Easily add Replicas in additional DCs

• Hybrid of Schema / Schema-less

• Low Barrier to Entry

Page 16: Scaling Cloud Web & Data Technologies

Docker

• Derivable Images

• Immutable Containers

• Same Docker works Everywhere

• Same Compose works Everywhere

• Industry Wide Adoption

• Thousands of Docker Images Available

• Thousands of Compose Files Available

• Runs on Linux or Windows (Natively)

• Tested with * on Linux

• Tested with DotNet core on Linux• Tested with Legacy .NET/MSSQL on Windows

Page 17: Scaling Cloud Web & Data Technologies

Docker Makes it Easy to Self Host

Page 18: Scaling Cloud Web & Data Technologies

Docker Makes it Easy to Self Host

Page 19: Scaling Cloud Web & Data Technologies

Docker Makes it Easy to Self Host

Page 20: Scaling Cloud Web & Data Technologies

Appleseed Framework

Page 21: Scaling Cloud Web & Data Technologies

What we do well.

Streamline, Organize & Unify

Business Information

Page 22: Scaling Cloud Web & Data Technologies

Ongoing Subscription + Kickstart

KICKSTART*- Discovery*- Workshop- Architecture*- Review- Engineering*- Delivery

SUBSCRIPTION- Support- Advisory / Guidance- Maintenance*- Config / Customization

- Managed*- Setup / Train / Admin

*service optionsFree Consultation

Page 23: Scaling Cloud Web & Data Technologies

www.anant.us | [email protected] | 202.905.28181010 Wisconsin Ave, NW | Suite 250 | Washington, DC 20007

Streamlined Data

Integration / Data Pipelines

Organized Knowledge

Search / Data Warehouses

Unified Interfaces

Portals / Dashboards / Mobile

Free Consultation