20
© 2015 IBM Corporation Bring Data Analytics to the Edge on Machielsen - Customer Technical Professional loud Data Services – IBM Digital Sales

Bringing Data Analytics to the Edge

Embed Size (px)

Citation preview

Page 1: Bringing Data Analytics to the Edge

© 2015 IBM Corporation

Bring Data Analytics to the EdgeTon Machielsen - Customer Technical ProfessionalCloud Data Services – IBM Digital Sales

Page 2: Bringing Data Analytics to the Edge

© 2015 IBM Corporation2

Powerful DBaaS For apps that need:• Elastic scalability• High availability• Data model flexibility• Data mobility• Text search

• Geospatial Available as:

• Fully managed DBaaS• On-premises private cloud• Hybrid architecture

With custom coding that enables unique data needs to be developed in days, not weeks

Cloudant – NoSQL Database as a Service

Page 3: Bringing Data Analytics to the Edge

© 2015 IBM Corporation3

dashDB – Analytics Warehouse as a Service

For apps that need:• Elastic scalability• High availability• Data model flexibility• Data mobility• Text search

• Geospatial

Available as:• Fully managed DBaaS• On-premises private cloud• Hybrid architecture

Announced in October 2014 DB2 BLU columnar technology

+ Netezza in-database analytics

• Rapid deployment of large scale data warehouses

• Flexible options for both volume and processing speed

• Unified architecture that enables hybrid data processing on-premises & cloud

BLU Acceleration

Netezza In-Database

Analytics

Cloudant Database

as a Service

Page 4: Bringing Data Analytics to the Edge

© 2015 IBM Corporation4

Fully AutomatedIntelligence

Natural LanguageDialogue

Guided AnalyticDiscovery

Single AnalyticsExperience

IBM Watson AnalyticsSelf-service analytics capabilities in the cloud

Page 5: Bringing Data Analytics to the Edge

© 2015 IBM Corporation5

Use Case / What we will see today

Management of a ficticious Airline company XYZ inotices a stagnation in growth.

Marketing is asked to conduct a Customer Satisfaction Survey to investigate which areas to improve.

1.Survey results are delivered to be analized

2.Survey data is imported in Cloudant for storage

3.Cloudant data is exported to DashDB for analysis

4.Watson Analytics is used to perform the analysis on the data

Page 6: Bringing Data Analytics to the Edge

© 2015 IBM Corporation6

Survey data delivered as CSV file

Page 7: Bringing Data Analytics to the Edge

© 2015 IBM Corporation7

CSV file gets imported into Cloudant

python csv-import.py -f <importfile.csv>-u <username> -d <database name>

Page 8: Bringing Data Analytics to the Edge

© 2015 IBM Corporation8

Sample JSON document in Cloudant

8

{ "_id": "0940405adf4b22beab341d90e4000034", "_rev": "1-a727ce80b1e01d1e7b8365874d4f64a8", "Origin State": "Texas", "Airline Code": "AA", "Type of Travel": "Business travel", "Shopping Amount at Airport": "0", "Airline Name": "Paul Smith Airlines Inc. ", "Scheduled Departure Hour": "8", "No of Flights p.a.": "8", "Airline Status": "Blue", "Flight Distance": "1172", "Orgin City": "Dallas/Fort Worth, TX", "Price Sensitivity": "1", "Arrival Delay greater 5 Mins": "no", "Class": "Eco Plus", "Arrival Delay in Minutes": "0", "Gender": "Male", "Age": "40", "Flight cancelled": "No", "No of Flights p.a. grouped": "1 to 10", "% of Flight with other Airlines": "19", "Departure Delay in Minutes": "5", "Year of First Flight": "2003", "Flight date": "02-14-2014", "Age Range": "40-49", "Destination State": "Virginia", "No. of other Loyalty Cards": "2", "Satisfaction": "4.0", "Day of Month": "14", "Destination City": "Washington, DC", "Flight time in minutes": "140", "Eating and Drinking at Airport": "60"}

Page 9: Bringing Data Analytics to the Edge

© 2015 IBM Corporation9

Integration between Cloudant and DashDB

Create Datawarehouse from Cloudant Dashboard.

Datawarehouse is automatically created on IBM Bluemix using an automatically generated DashDB instance.

Automatic Schema Discovery Process models the NoSQL database into a relational database structure and data is loaded into the DashDB database.

Page 10: Bringing Data Analytics to the Edge

© 2015 IBM Corporation10

Database structure in DashDB

Page 11: Bringing Data Analytics to the Edge

© 2015 IBM Corporation11

Migrating Cloudant JSON into dashDB

Cloudant’s Schema Discovery Process (SDP) translates JSON documents into a schema (or set of tables) that dashDB understands

Page 12: Bringing Data Analytics to the Edge

© 2015 IBM Corporation12

Sample JSON document in Cloudant{ "_id": "019b716168d45be2c2bd8371d4000d5c", "_rev": "1-4b4994f0dd1ddc56b96bd2a16cb080e7", "Received": "2010-10-18T11:36:10.057", "Period": "3rd Quarter (July 1 - Sep 30)", "Amount": "20000", "Client": { "ContactFullname": "JOHN MARK TRUMBORE", "ClientPPBCountry": "USA", "ClientID": "1000634", "GeneralDescription": "Non-profit organization dedicated to restoring the Chesapeake Bay and its tributary rivers", "ClientPPBState": "DISTRICT OF COLUMBIA", "ClientCountry": "USA", "IsStateOrLocalGov": "0", "ClientState": "DISTRICT OF COLUMBIA", "ClientName": "CHESAPEAKE BAY FOUNDATION", "ClientStatus": "1" }, "GovernmentEntities": [ { "GovEntityName": "HOUSE OF REPRESENTATIVES" } ], "Lobbyists": [ { "LobbyistName": "BORSKI, ROBERT A JR", "LobbyisteIndicator": "0", "LobbyistStatus": "0", "OfficialPosition": "Member of Congress (1983-2003)" }, { "LobbyistName": "TRUMBORE, JOHN MARK", "LobbyisteIndicator": "0", "LobbyistStatus": "0", "OfficialPosition": "Leg Asst (Rep. McHale), Leg Dir (Rep. R. Brady)" } ], "Year": "2010", "Type": "THIRD QUARTER REPORT", "Registrant": { "RegistrantPPBCountry": "USA", "RegistrantName": "Borski Associates", "RegistrantCountry": "USA", "RegistrantID": "84376", "GeneralDescription": "Government Relations Consulting", "Address": "4015 Fitler Street\r\nPhiladelphia, PA 19114" }, "Issues": [ { "Code": "ENVIRONMENT/SUPERFUND", "SpecificIssue": "Chesapeake Clean Water and Ecosystem Restoration Act (HR 3852)\nChesapeake Bay Program Reauthorization and Improvement Act (HR 5509)" } ]}

Page 13: Bringing Data Analytics to the Edge

© 2015 IBM Corporation13

Database structure in DashDB

LOBBY-SEARCH• _ID• _REV• AMOUNT• CLIENT_CLIENTCOUNTRY• CLIENT_CLIENTID• CLIENT_CLIENTNAME• CLIENT_CLIENTPPBCOUNTRY• CLIENT_CLIENTPPDSTATE. . . . . .

LOBBY-SEARCH_AFFILIATEDORGS• ARRAY_INDEX• AFFILIATEDORGCOUNTRY• AFFILIATEDORGNAME• AFFILIATEDORGPPBCCOUNTRY• _ID

LOBBY-SEARCH_FOREIGNENTITIES• ARRAY_INDEX• FOREIGNENTITYCONTRIBUTION• FOREIGNENTITYCOUNTRY• FOREIGNENTITYNAME• FOREIGNENTITYOWNERSHIPPERCENTAGE• FOREIGNENTITYPPBCOUNTRY• FOREIGNENTITYSTATUS• _ID

LOBBY-SEARCH_ISSUES• ARRAY_INDEX• CODE• SPECIFICISSUE• _ID

LOBBY-SEARCH_OVERFLOW• EXCEPTION• WARNING• _ID

LOBBY-SEARCH_GOVERNMENTENTITIES• ARRAY_INDEX• GOVENTITYNAME• _ID

LOBBY-SEARCH_LOBBYISTS• ARRAY_INDEX• LOBBYISTINDICATOR• LOBBYISTNAME• LOBBYISTSTATUS• OFFICIALPOSITION• _ID

LOBBY-SEARCH_GOVERNMENTENTITIES• ARRAY_INDEX• GOVENTITYNAME• _ID

Keys with array-values are stored in separate tables.

"Lobbyists": [ { "LobbyistName": "BORSKI, ROBERT A JR", "LobbyisteIndicator": "0", "LobbyistStatus": "0", "OfficialPosition": "Member of Congress (1983-2003)" }, { "LobbyistName": "TRUMBORE, JOHN MARK", "LobbyisteIndicator": "0", "LobbyistStatus": "0", "OfficialPosition": "Leg Asst (Rep. McHale), Leg Dir (Rep. R. Brady)" } ], "Year": "2010",

Page 14: Bringing Data Analytics to the Edge

© 2015 IBM Corporation14

Watson Analytics connection into DashDB

Page 15: Bringing Data Analytics to the Edge

© 2015 IBM Corporation15

Datasets in Watson Analytics

Page 16: Bringing Data Analytics to the Edge

© 2015 IBM Corporation16

Analysis in Watson Analytics

Page 17: Bringing Data Analytics to the Edge

© 2015 IBM Corporation17

Analysis in Watson Analytics

Page 18: Bringing Data Analytics to the Edge

© 2015 IBM Corporation18

Information delivery from Watson Analytics

Page 19: Bringing Data Analytics to the Edge

© 2015 IBM Corporation19

Q&A and more info

Cloudant – http://www.cloudant.comDashDB – http://www.dashdb.comWatson Analytics – http://www.watsonanalytics.comBluemix – http://www.bluemix.net

Me! – http://ibm.biz/ton_machielsen

Try it out for free!

Page 20: Bringing Data Analytics to the Edge

© 2015 IBM Corporation20