IM Forum 2013 PureData v2.0 FINAL.ppt

Embed Size (px)

Citation preview

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    1/24

    IBM Information Management Forum, 11. September 2013, Wien

    Pure Data System for HaoopBig Data !pp"ian#e

    Dipl.Ing. Wolfgang Nimfhr

    Business Development ExecutiveIBM Software GroupBig Data

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    2/24

    !"#$ International Business Machines %orporation

    ! IBM Information Management &orum' ##. Septem(er !"#$' Wien

    $%ery #ompany &as a Big Data ' !na"yti#s (pportunity

    )&e po*er of Data #oming toget&er

    Wit& t&e po*er of )e#&no"ogy

    )o e"i%er Impro%e (ut#omes

    $nri#& your information basewith Big Data Exploration

    Pre%ent #rimewith Security and Intelligence Extension

    (ptimi+e operationswith Operations Analysis

    ain I) effi#ien#y an s#a"ewith Data Warehouse Augmentation

    Impro%e #ustomer intera#tionwith Enhanced 360 !iew o" the #ustomer

    )o generate Business -a"ue

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    3/24

    !"#$ International Business Machines %orporation

    $ IBM Information Management &orum' ##. Septem(er !"#$' Wien 2013 IBM /orporation

    Ho* o you &arness t&e e"ep&ant in t&e room

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    4/24

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    5/24

    !"#$ International Business Machines %orporation

    4 IBM Information Management &orum' ##. Septem(er !"#$' Wien

    ive

    5ig

    Map+e/uceD&S

    HCatalog

    Visualization

    Development

    6ools

    et5s simp"ify Big Data 6

    Designed to

    Simplif- the (uil/ing'

    /eplo-ing an/ management

    of a a/oop cluster

    Spee/ the time2to2value for

    a/oop an/ unstructure/

    /ata

    Maximi7e the overall

    anal-tic ecos-stem 5rovi/e enterprise securit-

    an/ platform management

    From #ustom an #omp"e7 6)o organi+e simp"i#ity

    $Based on IB% internal testing and customer "eed&ac'( )#ustom &uilt clusters) re"er to clusters that are not

    pro"essionally pre*&uilt+ pre*tested and optimi,ed( Indi-idual results may -ary(

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    6/24

    !"#$ International Business Machines %orporation

    8 IBM Information Management &orum' ##. Septem(er !"#$' Wien

    !nnoun#ing t&e ne* PureData System for Haoop

    0ccelerate time to value

    0ccelerate time to insight

    Simplif- (ig /ata a/option an/ consumption

    Exten/ the value of the /ata warehouse

    Implement enterprise class (ig /ata

    Minimi7e s-stem setup an/ a/ministration

    System for Hadoop

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    7/24

    !"#$ International Business Machines %orporation

    9 IBM Information Management &orum' ##. Septem(er !"#$' Wien

    IBM PureData System for HaoopAccelerate .adoop analytics with appliance simplicity

    Spee

    Spee/ to insight with (uilt2in anal-tics Spee/ to value with accelerate/ /eplo-ment

    Simp"i#ity

    +ea/- to loa/ /ata in hours

    Integrate/ s-stem management

    0ppliance approach re/uces complexit-

    Single point of support

    Smart

    Esta(lish a cost efficient online /ata archive

    Easil- leverage /ata across the (ig /ataplatform

    Enterprise securit-' governance an/ high

    availa(ilit-

    System for Haoop

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    8/24

    !"#$ International Business Machines %orporation

    : IBM Information Management &orum' ##. Septem(er !"#$' Wien

    Benefits of IBM 5ureData S-stem for a/oop Dep"oy 87 faster

    than custom*&uilt solutions$

    Bui"t9in %isua"i+ation

    to accelerate insight

    Bui"t9in ana"yti# a##e"erators2unli'e &ig data appliances on the mar'et

    Sing"e system #onso"e

    "or "ull system administration

    :api maintenan#e upates

    with automation

    ;o assemb"y re

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    9/24

    !"#$ International Business Machines %orporation

    A IBM Information Management &orum' ##. Septem(er !"#$' Wien

    InfoSp&ere BigInsig&tsEntry 1oint "or .adoop

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    10/24

    !"#$ International Business Machines %orporation

    #" IBM Information Management &orum' ##. Septem(er !"#$' Wien

    !na"yti# !##e"erators Social Me/ia 0ccelerator Machine Data 0ccelerator BigSheets sprea/sheet an/ visuali7ation 0/vance/ 6ext 0nal-tics 0ccelerator C0? ,uer- language

    Performan#e an (ptimi+ation 0/aptive Map +e/uce

    0/vance/ Sche/uler BigIn/ex for large scale in/exing &ast' splitta(le compression

    Se#urity +ole (ase/ authori7ation

    (ptim De%e"opment Stuio Eclipse (ase/ IDE for Cava

    Big Data Integration Information Server' InfoSphere

    Streams' Nete77a' DB!

    $nterprise $nab"ement Big S?

    G5&S2&%>

    IB%2s distri&ution is &ased on Apache .adoop and utili,es many o" the capa&ilities

    includes in that distri&ution+ &ut IB% is "ocused on ma'ing its distri&ution more o" an

    enterprise class o""ering(

    BigInsig&ts -a"ue !bo%e an Beyon Haoop

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    11/24

    !"#$ International Business Machines %orporation

    ## IBM Information Management &orum' ##. Septem(er !"#$' Wien

    IBM2certifie/ 0pache a/oop

    !ministration ' Se#urity

    Wor4"oa (ptimi+ation

    Integrate De%e"opment $n%ironment

    /onne#tors =Data, !na"yti#s, Integration>

    !%an#e )e7t !na"yti#s $ngine

    -isua"i+ation ' $7p"oration

    (pensour#e#omponents

    !itiona"

    enterprise#apabi"ities

    ?ey enterprise #apabi"ities on top of an unmoifie opensour#e founation

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    12/24

    !"#$ International Business Machines %orporation

    #! IBM Information Management &orum' ##. Septem(er !"#$' Wien

    1

    2

    BigInsig&ts 2.1 features a %ariety of en&an#ements t&ate"i%er 4ey $nterprise Haoop #apabi"ities

    * >ut of the (ox igh

    0vaila(ilit-

    * Seamless' automatic an/

    transparent failover for D&S

    NameNo/e* Eliminates a/min intervention

    * +e/uces /owntime for

    recover- of the cluster

    * ar/ware fencing to

    guarantee /ata integrit-

    * No single point of failure

    * Built2in igh 0vaila(ilit-

    * 5>SI compliance

    * Enhance/ Securit- with 0%?

    support

    * Support for Storage 5ools

    * SnapShot capa(ilit-

    * %omprehensive Stan/ar/ 0NSI

    S? support to access /ata

    store/ in BigInsights

    * Stan/ar/s compliant CDB% F

    >DB% /rivers* ?everages Map+e/uce

    parallelism in complex /ata sets

    * Direct access for low2latenc- in

    small ,ueries' e.g. su(2secon/

    response to Base ,ueries

    Hig& !%ai"abi"ityPFS9FP( supportBig S@ /ognos %10, 6

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    13/24

    !"#$ International Business Machines %orporation

    #$ IBM Information Management &orum' ##. Septem(er !"#$' Wien

    BigInsig&ts $nterprise $ition 2.1

    %onnectivit- an/ Integration Streams

    Nete77a

    6extprocessingengine an/li(rar-

    CDB%

    &lume

    Infrastructure Ca,l

    ive

    5ig

    Base

    Map+e/uce

    D&S

    ooHeeperIn/exing ?ucene

    0/aptiveMap+e/uce

    >o7ie

    6ext compression

    Enhance/securit-

    &lexi(lesche/uler

    >ptional

    IBM an/

    partner

    offerings

    0nal-tics an/ /iscover- 0ppsJ

    DB!

    BigSheets

    We( %rawler

    Distri( file cop-

    DB export

    Boar/rea/er

    DB import

    0/ hoc ,uer-

    Machinelearning

    Dataprocessing

    . . .

    0/ministrative an/

    /evelopment tools

    We( console

    * Monitor cluster health' Ko(s' etc.

    * 0// L remove no/es

    * Start L stop services

    * Inspect Ko( status

    * Inspect wor;flow status* Deplo- applications

    * ?aunch apps L Ko(s* Wor; with /istri( file s-stem

    *Wor; with sprea/sheet interface

    *Support +ES62(ase/ 05I* . . .

    +

    Eclipse tools

    * 6ext anal-tics* Map+e/uce programming

    * Ca,l' ive' 5ig /evelopment

    * BigSheets plug2in /evelopment* >o7ie wor;flow generation

    Integrate/installer

    >pen Source IBMIBM

    %ognos BI

    Big S?

    0ccelerator formachine /ataanal-sis

    0ccelerator forsocial /ataanal-sis

    Guar/ium DataStageData Explorer

    S,oop

    %atalogG5&S &5>G5&S &5>

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    14/24

    !"#$ International Business Machines %orporation

    #) IBM Information Management &orum' ##. Septem(er !"#$' Wien

    $7p"oration,Integrate

    Ware&ouse, anMart Aones

    Discover-

    Deep +eflection >perational

    5re/ictive

    All Data Sources

    Information

    Ingestion

    an

    (perationa"

    Information

    /aseManagement

    !na"yti#s

    !pp"i#ations

    !"erts

    aning !rea,

    !na"yti#s Aone

    an !r#&i%e

    +aw Data Structure/ Data 6ext 0nal-tics

    Data Mining Entit- 0nal-tics

    Machine?earning

    :ea"9time

    !na"yti# Aone i/eoL0u/io

    Networ;LSensor

    Entit- 0nal-tics

    5re/ictive

    Stream5rocessing

    DataIntegration

    Master Data

    Streams

    Information o%ernan#e, Se#urity an Business /ontinuityInformation o%ernan#e, Se#urity an Business /ontinuity

    Big Data Ecosystem Analytic Applications

    /ogniti%e4earn Dynamically5

    Pres#ripti%e

    Best Outcomes5

    Prei#ti%eWhat #ould .appen5

    Des#ripti%eWhat .as .appened5

    $7p"oration anDis#o%ery

    What Do ou .a-e5

    /"ou

    Ser%i#es

    IBM Watson

    IBM Big Data ' !na"yti#s :eferen#e !r#&ite#ture

    PureData

    System

    for

    Haoop

    !#ti%e

    !r#&i%e

    Big Data$7p"oration

    Pre

    Pro#essingHub

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    15/24

    !"#$ International Business Machines %orporation

    #4 IBM Information Management &orum' ##. Septem(er !"#$' Wien

    7se #ase8 Big Data $7p"oration

    se /ases

    Explore new /ata an/ previousl-

    untappe/ sources

    isuali7e an/ gain new insight with eas-

    to use sprea/sheet2st-le anal-sis

    I/entif- useful information that woul/

    a// value when integrate/

    =se/ for /ata profiling to un/erstan//ata (efore moving to other s-stems

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    16/24

    !"#$ International Business Machines %orporation

    #8 IBM Information Management &orum' ##. Septem(er !"#$' Wien

    0/2hoc anal-tics for /ata scientists

    0nal-7e a variet- of /ata 2

    unstructure/ an/ structure/

    Browser2(ase/

    Sprea/sheet metaphor for exploringL

    visuali7ing /ata

    Gather Extract Explore Iterate

    %rawl gather statisticall-

    0/aptergather /-namicall-

    Document2level info

    %leanse' normali7e

    0nal-7e' annotate' filter

    isuali7e resultsIterate through an- prior

    step

    Sprea/sheet2st-le anal-sis process with BigSheets

    7se #ase8 Big Data $7p"oration

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    17/24

    !"#$ International Business Machines %orporation

    #9 IBM Information Management &orum' ##. Septem(er !"#$' Wien

    7se #ase8 !#ti%e !r#&i%e

    se /ases

    Imme/iate storage alternative of col/

    /ata

    %ost savings for col/ /ata

    %ompliance re,uirements

    Simple anal-tics L exploration

    PureDataSystem for Analytics

    PureDataSystem for Hadoop

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    18/24

    !"#$ International Business Machines %orporation

    #: IBM Information Management &orum' ##. Septem(er !"#$' Wien

    7se #ase8 Pre9Pro#essing Hub

    se /ases

    0ggregation of /ata

    5re2process cleansing

    %ompliance re,uirements

    Simple anal-tics L exploration

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    19/24

    !"#$ International Business Machines %orporation

    #A IBM Information Management &orum' ##. Septem(er !"#$' Wien

    PureData System for HaoopBringing Big Data to the enterprise

    Simplif- the /eliver- of unstructure/

    /ata to the enterprise

    Integrate a/oop with the /ata

    warehouse

    ?everage a/oop for /ata archive

    5rovi/e (est in class securit-

    5rovi/e /ata exploration across

    structure/ an/ unstructure/ /ata

    0ccelerate insight with machine /ata

    0ccelerate insight with social /ata

    Simp"ify

    BigDa

    ta

    fort&e

    enter

    prise1

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    20/24

    !"#$ International Business Machines %orporation

    !" IBM Information Management &orum' ##. Septem(er !"#$' Wien

    For apps "i4e $9#ommer#e6

    Database cluster services optimized for

    transactional throughput and scalability

    For apps "i4e /ustomer !na"ysis6

    Data warehouse services optimized for

    high-speed, peta-scale analytics and simplicity

    For apps "i4e :ea"9time Frau Dete#tion6

    perational data warehouse services optimized to

    balance high performance analytics and real-time

    operational throughput

    !eeting "ig Data #hallenges $ %ast and &asy'

    IBM PureData System Fami"y

    System for Transactions

    System for Analytics

    System for Operational Analytics

    System for Hadoop

    For $7p"oratory !na"ysis ' @ueryab"e !r#&i%e

    Hadoop data services optimized for big data analyticsand online archive with appliance simplicity

    2013 IBM /orporation20

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    21/24

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    22/24

    !"#$ International Business Machines %orporation

    !! IBM Information Management &orum' ##. Septem(er !"#$' Wien

    $7p"oration,Integrate

    Ware&ouse, anMart Aones

    Discover-

    Deep +eflection >perational

    5re/ictive

    All Data Sources

    Information

    Ingestion

    an

    (perationa"

    Information/ase

    Management

    !na"yti#s

    !pp"i#ations

    !"erts

    aning !rea,

    !na"yti#s Aone

    an !r#&i%e

    +aw Data Structure/ Data 6ext 0nal-tics

    Data Mining Entit- 0nal-tics

    Machine?earning

    :ea"9time

    !na"yti# Aone

    i/eoL0u/io

    Networ;LSensor

    Entit- 0nal-tics

    5re/ictive

    Stream5rocessing

    DataIntegration

    Master Data

    Streams

    Information o%ernan#e, Se#urity an Business /ontinuityInformation o%ernan#e, Se#urity an Business /ontinuity

    Big Data Ecosystem Analytic Applications

    /ogniti%e4earn Dynamically5

    Pres#ripti%eBest Outcomes5

    Prei#ti%eWhat #ould .appen5

    Des#ripti%eWhat .as .appened5

    $7p"oration anDis#o%ery

    What Do ou .a-e5

    /"ou

    Ser%i#es

    IBM Watson

    IBM Big Data ' !na"yti#s :eferen#e !r#&ite#ture

    7%ise

    Data

    $7p"orer

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    23/24

    !"#$ International Business Machines %orporation

    !$ IBM Information Management &orum' ##. Septem(er !"#$' Wien

  • 8/11/2019 IM Forum 2013 PureData v2.0 FINAL.ppt

    24/24

    !"#$ International Business Machines %orporation

    !) IBM Information Management &orum' ##. Septem(er !"#$' Wien

    Part of t&e IBM Big Data P"atform(or)load ptimized Solutions for all your analytic needs

    !na"yti#s ' De#ision Management

    So"utions

    Big Data Infrastru#ture

    IBM Big Data P"atform

    !##e"erators

    Information Integration ' o%ernan#e

    -isua"i+ation' Dis#o%ery

    !pp"i#ationDe%e"opment

    SystemsManagement

    Stream/omputing

    HaoopSystem

    DataWare&ouse

    PureDataSystem for Analytics

    PureDataSystem for Hadoop

    2013 IBM /orporation2E