73
Machine Learning & Data Lake for IoT scenarios on AWS John Chang Technology Evangelist October 2016

Machine Learning & Data Lake for IoT scenarios on AWS

Embed Size (px)

Citation preview

Page 1: Machine Learning & Data Lake for IoT scenarios on AWS

Machine Learning & Data Lake for IoT scenarios on AWS

JohnChang

TechnologyEvangelistOctober2016

Page 2: Machine Learning & Data Lake for IoT scenarios on AWS

Three types of data-driven development

Retrospec)veanalysisandrepor<ng

AmazonRedshiA,AmazonRDSAmazonS3AmazonEMR

Page 3: Machine Learning & Data Lake for IoT scenarios on AWS

Three types of data-driven development

Retrospec)veanalysisandrepor<ng

Here-and-nowreal-<meprocessinganddashboards

AmazonKinesisAmazonEC2AWSLambda

AmazonRedshiA,AmazonRDSAmazonS3AmazonEMR

Page 4: Machine Learning & Data Lake for IoT scenarios on AWS

Three types of data-driven development

Retrospec)veanalysisandrepor<ng

Here-and-nowreal-<meprocessinganddashboards

Predic)onstoenablesmartapplica<ons

AmazonKinesisAmazonEC2AWSLambda

AmazonRedshiA,AmazonRDSAmazonS3AmazonEMR

Page 5: Machine Learning & Data Lake for IoT scenarios on AWS

Machine learning and smart applica@ons Machinelearningisthetechnologythatautoma<callyfindspaMernsinyourdataandusesthemtomakepredic<onsfornewdatapointsastheybecomeavailable.

Page 6: Machine Learning & Data Lake for IoT scenarios on AWS

Machine learning and smart applica@ons Machinelearningisthetechnologythatautoma<callyfindspaMernsinyourdataandusesthemtomakepredic<onsfornewdatapointsastheybecomeavailable.

Yourdata+machinelearning=smartapplica<ons

Page 7: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by example

Basedonwhatyouknowabouttheuser:Willtheyuseyourproduct?

Page 8: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by example

Basedonwhatyouknowabouttheuser:Willtheyuseyourproduct?

Basedonwhatyouknowaboutanorder:Isthisorderfraudulent?

Page 9: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by example

Basedonwhatyouknowabouttheuser:Willtheyuseyourproduct?

Basedonwhatyouknowaboutanorder:Isthisorderfraudulent?

Basedonwhatyouknowaboutanewsar)cle:Whatotherar)clesareinteres)ng?

Page 10: Machine Learning & Data Lake for IoT scenarios on AWS

And a few more examples… Frauddetec)on Detec<ngfraudulenttransac<ons,filteringspamemails,

flaggingsuspiciousreviews,…

Personaliza)on Recommendingcontent,predic<vecontentloading,improvinguserexperience,…

Targetedmarke)ng Matchingcustomersandoffers,choosingmarke<ngcampaigns,cross-sellingandup-selling,…

Contentclassifica)on Categorizingdocuments,matchinghiringmanagersandresumes,…

Churnpredic)on Findingcustomerswhoarelikelytostopusingtheservice,upgradetarge<ng,…

Customersupport Predic<verou<ngofcustomeremails,socialmedialistening,…

Page 11: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by counterexample

Dear Alex,

This awesome quadcopter is on sale for just $49.99!

Page 12: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by counterexample SELECT c.ID

FROM customers c

LEFT JOIN orders o

ON c.ID = o.customer

GROUP BY c.ID

HAVING o.date > GETDATE() – 30

Wecanstartbysendingtheoffertoallcustomerswhoplacedanorderinthelast30days

Page 13: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by counterexample SELECT c.ID

FROM customers c

LEFT JOIN orders o

ON c.ID = o.customer

GROUP BY c.ID

HAVING O.CATEGORY = ‘TOYS’

AND o.date > GETDATE() – 30

…let’snarrowitdowntojustcustomerswhoboughttoys

Page 14: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by counterexample SELECT c.IDFROM customers c LEFT JOIN orders o ON c.ID = o.customer

LEFT JOIN PRODUCTS P ON P.ID = O.PRODUCTGROUP BY c.IDHAVING o.category = ‘toys’ AND ((P.DESCRIPTION LIKE ‘%HELICOPTER%’ AND O.DATE > GETDATE() - 60) OR (COUNT(*) > 2 AND SUM(o.price) > 200 AND o.date > GETDATE() – 30) )

…andexpandthequerytocustomerswhopurchasedothertoyhelicoptersrecently,ormadeseveralexpensivetoypurchases

Page 15: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by counterexample SELECT c.ID

FROM customers c

LEFT JOIN orders o

ON c.ID = o.customer

LEFT JOIN products p

ON p.ID = o.product

GROUP BY c.ID

HAVING o.category = ‘toys’

AND ((p.description LIKE ‘%COPTER%’

AND o.date > GETDATE() - 60)

OR (COUNT(*) > 2

AND SUM(o.price) > 200

AND o.date > GETDATE() – 30)

)

…butwhataboutquadcopters?

Page 16: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by counterexample SELECT c.IDFROM customers c LEFT JOIN orders o ON c.ID = o.customer LEFT JOIN products p ON p.ID = o.productGROUP BY c.IDHAVING o.category = ‘toys’ AND ((p.description LIKE ‘%copter%’ AND o.date > GETDATE() - 120) OR (COUNT(*) > 2 AND SUM(o.price) > 200 AND o.date > GETDATE() – 30) )

…maybeweshouldgobackfurtherin<me

Page 17: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by counterexample SELECT c.ID

FROM customers c

LEFT JOIN orders o

ON c.ID = o.customer

LEFT JOIN products p

ON p.ID = o.product

GROUP BY c.ID

HAVING o.category = ‘toys’

AND ((p.description LIKE ‘%copter%’

AND o.date > GETDATE() - 120)

OR (COUNT(*) > 2

AND SUM(o.price) > 200

AND o.date > GETDATE() – 40)

)

…tweakthequerymore

Page 18: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by counterexample SELECT c.IDFROM customers c LEFT JOIN orders o ON c.ID = o.customer LEFT JOIN products p ON p.ID = o.productGROUP BY c.IDHAVING o.category = ‘toys’ AND ((p.description LIKE ‘%copter%’ AND o.date > GETDATE() - 120) OR (COUNT(*) > 2 AND SUM(o.price) > 150 AND o.date > GETDATE() – 40) )

…again

Page 19: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by counterexample SELECT c.ID

FROM customers c

LEFT JOIN orders o

ON c.ID = o.customer

LEFT JOIN products p

ON p.ID = o.product

GROUP BY c.ID

HAVING o.category = ‘toys’

AND ((p.description LIKE ‘%copter%’

AND o.date > GETDATE() - 90)

OR (COUNT(*) > 2

AND SUM(o.price) > 150

AND o.date > GETDATE() – 40)

)

…andagain

Page 20: Machine Learning & Data Lake for IoT scenarios on AWS

Smart applica@ons by counterexample SELECT c.ID

FROM customers c

LEFT JOIN orders o

ON c.ID = o.customer

LEFT JOIN products p

ON p.ID = o.product

GROUP BY c.ID

HAVING o.category = ‘toys’

AND ((p.description LIKE ‘%copter%’

AND o.date > GETDATE() - 90)

OR (COUNT(*) > 2

AND SUM(o.price) > 150

AND o.date > GETDATE() – 40)

)

Usemachinelearningtechnologytolearnyourbusinessrulesfromdata!

Page 21: Machine Learning & Data Lake for IoT scenarios on AWS

Why aren’t there more smart applica@ons? 1.  Machinelearningexper<seisrare.

2.  Buildingandscalingmachinelearningtechnologyishard.

3.  Closingthegapbetweenmodelsandapplica<onsis)me-consumingandexpensive.

Page 22: Machine Learning & Data Lake for IoT scenarios on AWS

Building smart applica@ons today Exper)se Technology Opera)onaliza)on

Limitedsupplyofdatascien<sts

Manychoices,fewmainstays Complexanderror-pronedataworkflows

Expensivetohireoroutsource

Difficulttouseandscale

CustompladormsandAPIs

Manymovingpiecesleadtocustomsolu<onsevery&me

Reinven<ngthemodellifecyclemanagementwheel

Page 23: Machine Learning & Data Lake for IoT scenarios on AWS

What if there were a beIer way?

Page 24: Machine Learning & Data Lake for IoT scenarios on AWS

Introducing Amazon Machine Learning Easy-to-use,managedmachinelearningservicebuiltfordevelopers

Robust,powerfulmachinelearningtechnologybasedonAmazon’sinternalsystems

CreatemodelsusingyourdataalreadystoredintheAWScloud

Deploymodelstoproduc<oninseconds

Page 25: Machine Learning & Data Lake for IoT scenarios on AWS

Easy-to-use and developer-friendly Usetheintui<ve,powerfulserviceconsoletobuildandexploreyourini<almodels

•  Dataretrieval•  Modeltraining,qualityevalua<on,fine-tuning•  Deploymentandmanagement

AutomatemodellifecyclewithfullyfeaturedAPIsandSDKs

•  Java,Python,.NET,JavaScript,Ruby,PHPEasilycreatesmartiOSandAndroidapplica<onswithAWSMobileSDK

Page 26: Machine Learning & Data Lake for IoT scenarios on AWS

Powerful machine learning technology BasedonAmazon’sbaMle-hardenedinternalsystems

Notjustthealgorithms:•  Smartdatatransforma<ons•  Inputdataandmodelqualityalerts•  Built-inindustrybestprac<ces

Growswithyourneeds

•  Trainonupto100GBofdata•  Generatebillionsofpredic<ons•  Obtainpredic<onsinbatchesorreal-<me

Page 27: Machine Learning & Data Lake for IoT scenarios on AWS

Integrated with the AWS data ecosystem AccessdatathatisstoredinAmazonS3,AmazonRedshiA,orMySQLdatabasesinAmazonRDSOutputpredic<onstoAmazonS3foreasyintegra<onwithyourdataflowsUseAWSIden<tyandAccessManagement(IAM)forfine-graineddataaccesspermissionpolicies

Page 28: Machine Learning & Data Lake for IoT scenarios on AWS

Fully-managed model and predic@on services End-to-endservice,withnoserverstoprovisionandmanage

One-clickproduc<onmodeldeployment

Programma<callyquerymodelmetadatatoenableautoma<cretrainingworkflows

Monitorpredic<onusagepaMernswithAmazonCloudWatchmetrics

Page 29: Machine Learning & Data Lake for IoT scenarios on AWS

Pay-as-you-go and inexpensive Dataanalysis,modeltraining,andevalua<on:$0.42/instancehour

Batchpredic<ons:$0.10/1000Real-<mepredic<ons:$0.10/1000+hourlycapacityreserva<oncharge

Page 30: Machine Learning & Data Lake for IoT scenarios on AWS

Three supported types of predic@ons Binaryclassifica<on

PredicttheanswertoaYes/Noques<on

Mul<classclassifica<onPredictthecorrectcategoryfromalist

Regression

Predictthevalueofanumericvariable

Page 31: Machine Learning & Data Lake for IoT scenarios on AWS

Train model

Evaluate and op@mize

Retrieve predic@ons

1 2 3

Building smart applica@ons with Amazon ML

Page 32: Machine Learning & Data Lake for IoT scenarios on AWS

Trainmodel

Evaluateandop<mize

Retrievepredic<ons

1 2 3

Building smart applica@ons with Amazon ML

-  Createadatasourceobjectpoin<ngtoyourdata-  Exploreandunderstandyourdata-  Transformdataandtrainyourmodel

Page 33: Machine Learning & Data Lake for IoT scenarios on AWS

Create a datasource object

>>> import boto

>>> ml = boto.connect_machinelearning()

>>> ds = ml.create_data_source_from_s3(

data_source_id = ’my_datasource',

data_spec = {

'DataLocationS3': 's3://bucket/input/data.csv',

'DataSchemaLocationS3': 's3://bucket/input/data.schema',

’compute_statistics’: True } )

Page 34: Machine Learning & Data Lake for IoT scenarios on AWS

Explore and understand your data

Page 35: Machine Learning & Data Lake for IoT scenarios on AWS

Train your model

>>> import boto

>>> ml = boto.connect_machinelearning()

>>> model = ml.create_ml_model(

ml_model_id = ’my_model',

ml_model_type = 'REGRESSION',

training_data_source_id = 'my_datasource')

Page 36: Machine Learning & Data Lake for IoT scenarios on AWS

Train model

Evaluate and optimize

Retrieve predictions

1 2 3

Building smart applica@ons with Amazon ML

-  Measureandunderstandmodelquality-  Adjustmodelinterpreta<on

Page 37: Machine Learning & Data Lake for IoT scenarios on AWS

Explore model quality

Page 38: Machine Learning & Data Lake for IoT scenarios on AWS

Fine-tune model interpreta@on

Page 39: Machine Learning & Data Lake for IoT scenarios on AWS

Fine-tune model interpreta@on

Page 40: Machine Learning & Data Lake for IoT scenarios on AWS

Train model

Evaluate and optimize

Retrieve predictions

1 2 3

Building smart applica@ons with Amazon ML

-  Batchpredic<ons-  Real-<mepredic<ons

Page 41: Machine Learning & Data Lake for IoT scenarios on AWS

Batch predic@ons Asynchronous,large-volumepredic<ongenera<onRequestthroughserviceconsoleorAPIBestforapplica<onsthatdealwithbatchesofdatarecords

>>> import boto

>>> ml = boto.connect_machinelearning()

>>> model = ml.create_batch_prediction(

batch_prediction_id = 'my_batch_prediction’,

batch_prediction_data_source_id = ’my_datasource’,

ml_model_id = ’my_model',

output_uri = 's3://examplebucket/output/’)

Page 42: Machine Learning & Data Lake for IoT scenarios on AWS

Real-@me predic@ons Synchronous,low-latency,high-throughputpredic<ongenera<onRequestthroughserviceAPI,server,ormobileSDKsBestforinterac<onapplica<onsthatdealwithindividualdatarecords

>>> import boto

>>> ml = boto.connect_machinelearning()

>>> ml.predict(

ml_model_id = ’my_model',

predict_endpoint = ’example_endpoint’,

record = {’key1':’value1’, ’key2':’value2’})

{

'Prediction': {

'predictedValue': 13.284348,

'details': {

'Algorithm': 'SGD',

'PredictiveModelType': 'REGRESSION’

}

}

}

Page 43: Machine Learning & Data Lake for IoT scenarios on AWS

Data Lake

Page 44: Machine Learning & Data Lake for IoT scenarios on AWS

Retailers need to deliver con@nuous differen@a@on

Personalization Merchandising Real-time engagement

Page 45: Machine Learning & Data Lake for IoT scenarios on AWS

Personalization Merchandising Real-time engagement

Retailers need to deliver con@nuous differen@a@on

Page 46: Machine Learning & Data Lake for IoT scenarios on AWS

Afull-serviceresiden<alrealestatebrokerage

Redfin manages data on hundreds of millions

of properties and millions of customers

The Hot Homes algorithm automatically calculates

the likelihood by analyzing more than 500 attributes of

each home

Was fully AWS-native since day one

https://aws.amazon.com/solutions/case-studies/redfin/

Page 47: Machine Learning & Data Lake for IoT scenarios on AWS

Hot Homes

There's an 80% chance this home will sell in the next 11 days – go tour it soon.

Page 48: Machine Learning & Data Lake for IoT scenarios on AWS

Ingest/Collect

Consume/visualizeStore Process/

analyze

Data1 40 9

5

AmazonS3Datalake AmazonEMR

AmazonKinesis

AmazonRedShiV

Answers&Insights

HotHomesUsers

Proper)es

Agents

User Profile Recommendation Hot Homes Similar Homes Agent Follow-up Agent Scorecard Marketing A/B Testing Real Time Data …

AmazonDynamoDB

BI/Repor)ng

Page 49: Machine Learning & Data Lake for IoT scenarios on AWS

Redfin Manages Data on Hundreds of Millions of Properties Using AWS

.

Once we solved the infrastructure problem, we could

dream a little bigger. Now we can deliver results without

worrying about how to scale.

Yong Huang, Director, Big Data and Analytics

“ •  Zero on-premises infrastructure

•  Using spot pricing for EC2, Redfin saved 90% compared to running on-demand

•  Using AWS, Redfin maintains a small technical team, allowing much simplified server management and allowing the transition to DevOps

•  Redfin is able to launch products like Hot Homes to greatly increase the buyer experience, by leveraging the agility and scale of AWS

Page 50: Machine Learning & Data Lake for IoT scenarios on AWS

Personalization Merchandising Real-time engagement

Retailers need to deliver con@nuous differen@a@on

Page 51: Machine Learning & Data Lake for IoT scenarios on AWS

American upscale fashion retailer

Nordstrom has 323 stores operating in 38 of the United States and also in Canada; the largest in number of

stores and geographic footprint

of its retail competitors

Fashion retailer that sells clothing, shoes, cosmetics,

and accessories

Nordstrom is going all in on AWS

https://aws.amazon.com/solutions/case-studies/nordstrom/

NORDSTROM

Page 52: Machine Learning & Data Lake for IoT scenarios on AWS
Page 53: Machine Learning & Data Lake for IoT scenarios on AWS

Ingest/ Collect

Consume/ visualize Store Process/

analyze

Data 1 4

0 9 5 Outcomes

& Insights

Personalized recommendations within seconds (from 15-20 min) Scale the expertise of stylists to all shoppers Reduce costs by 2X order of magnitude …

Mobile Users

Desktop Users

Analytics Tools

Online Stylist

Amazon RedShift

Amazon Kinesis

AWS Lambda

Amazon DynamoDB

AWS Lambda

AmazonS3DataStorage

NORDSTROM

Page 54: Machine Learning & Data Lake for IoT scenarios on AWS

Nordstrom gives personalized style recommendations in seconds

.

Alert me when the internet is down ...

Keith Homewood Cloud Product Owner, Nordstrom

“ •  Nordstrom Recommendation is the online version of a stylist. It can analyze and deliver personalized recommendations in seconds

•  Going All-In on AWS has resulted in reducing costs by 2X

•  Continuous delivery allows Nordstrom to deliver multiple production launches a day in a single application

•  Can now create a personalized recommendation in seconds, in what used to take 15-20 minutes of processing

•  Nordstrom Cloud Product Owner finds the reliability and availability of AWS so suitable that as long as the internet is working, Nordstrom Recommendation is working

Nordstrom

Page 55: Machine Learning & Data Lake for IoT scenarios on AWS

Personalization Merchandising Real-time engagement

Retailers need to deliver con@nuous differen@a@on

Page 56: Machine Learning & Data Lake for IoT scenarios on AWS

Technology that helps brick-and-mortar retailers optimize performance

Trusted by over 500 global brands in

45 countries worldwide and counting

Euclid analyzes customer movement data to

correlate traffic with marketing campaigns and to help retailers optimize

hours for peak traffic

Was fully AWS-native since day one

https://aws.amazon.com/solutions/case-studies/euclid/

Page 57: Machine Learning & Data Lake for IoT scenarios on AWS
Page 58: Machine Learning & Data Lake for IoT scenarios on AWS

Ingest/Collect

Consume/visualizeStore Process/

analyze

Data1 40 9

5

Answers&Insights

EuclidAnaly)cs

Campaigns

WiFi-Foottraffic

Transac)ons

Walk-Bys New & Return Visitors Visit Duration Engagement Rate Bounce Rate Storefront Potential & Conversion

Customer segmentation and loyalty assessment Regional and categorical roll-up reporting Zoning for large-format locations

EuclidEventIQAmazonS3Datalake

AmazonRDSforMySQL

AmazonEMR

AmazonRedShiV

AmazonEC2

AmazonElas)cBeanstalk

Elas)cLoadBalancing

Page 59: Machine Learning & Data Lake for IoT scenarios on AWS

AmazonRedshiA AmazonElas<cMapReduce

DataWarehouse Semi-structured

Amazon Glacier

Use an op@mal combina@on of highly interoperable services

AmazonSimpleStorageService

DataStorage Archive

AmazonDynamoDB

AmazonMachineLearning

AmazonKinesis

NoSQL Predic)veModels OtherAppsStreaming

Page 60: Machine Learning & Data Lake for IoT scenarios on AWS

AWS IoT

“SecurelyconnectoneoronebilliondevicestoAWS,sotheycaninteractwithapplica<onsandotherdevices”

Page 61: Machine Learning & Data Lake for IoT scenarios on AWS

AWS IoT

DEVICESDKSetofclientlibrariestoconnect,

authen<cateandexchangemessages

DEVICEGATEWAYCommunicatewithdevicesvia

MQTTandHTTP

AUTHENTICATIONAUTHORIZATIONSecurewithmutual

authen<ca<onandencryp<on

RULESENGINETransformmessages

basedonrulesandroutetoAWSServices

AWSServices-----

3PServices

DEVICESHADOWPersistentthingstateduringintermiMentconnec<ons

APPLICATIONS

AWSIoTAPI

DEVICEREGISTRYIden<tyandManagementof

yourthings

Page 62: Machine Learning & Data Lake for IoT scenarios on AWS

AWS IoT Rules Engine Ac@ons

RULESENGINETransformmessages

basedonrulesandroutetoAWSServices

AWSServices-----

3PServices

AWSServices-----

3PServices

Page 63: Machine Learning & Data Lake for IoT scenarios on AWS

1.AWSServices(DirectIntegra&on)

RulesEngine

Ac<ons

AWS IoT Rules Engine

AWSLambda

AmazonSNS

AmazonSQS

AmazonS3

AmazonKinesis

AmazonDynamoDB Amazon RDS

Amazon Redshift

Amazon Glacier

Amazon EC2

3.ExternalEndpoints(viaLambdaandSNS)

RulesEngineconnectsAWSIoTtoExternalEndpointsandAWSServices.

2.RestofAWS(viaAmazonKinesis,AWSLambda,AmazonS3,andmore)

Page 64: Machine Learning & Data Lake for IoT scenarios on AWS

AWS IoT Rules Engine Ac@ons

Rules Engine evaluates inbound messages published into AWS IoT, transforms and delivers to the appropriate endpoint based on business rules. External endpoints can be reached via Lambda and Simple Notification Service (SNS).

InvokeaLambdafunc<on

PutobjectinanS3bucket

Insert,Update,ReadfromaDynamoDBtablePublishtoanSNSTopicorEndpoint

PublishtoanAmazonKinesisstream

Ac<ons

AmazonKinesisFirehose

RepublishtoAWSIoT

AmazonMachineLearning

AmazonElas<csearch

Page 65: Machine Learning & Data Lake for IoT scenarios on AWS

AWS IoT Rules Engine & Amazon SNS

Push Notifications Apple APNS Endpoint, Google GCM Endpoint, Amazon ADM Endpoint, Windows WNS Amazon SNS -> HTTP Endpoint (Or SMS or Email) Call HTTP based 3rd party endpoints through SNS with subscription and retry support

SNS

2

Page 66: Machine Learning & Data Lake for IoT scenarios on AWS

AWS IoT Rules Engine for Machine Learning

Anomaly Detection Amazon Machine Learning can feed predictive evaluation criteria to the Rules Engine Continuous Improvement Around Predication Continuously look for outliers and re-calibrate the Amazon Machine Learning models

SendtoS3

AmazonMachineLearning

Re-Train

S3

Page 67: Machine Learning & Data Lake for IoT scenarios on AWS

The image cannot be displayed. Your computer may not have enough memory to open the image, or the image may have been corrupted. Restart your computer, and then open the file again. If the red x still appears, you may have to delete the image and then insert it again.

AWS IoT Rules Engine & Stream Data

N:1 Inbound Streams of Sensor Data (Signal to Noise Reduction) Rules Engine filters, transforms sensor data then sends aggregate to Amazon Kinesis Amazon Kinesis Streams to Enterprise Applications Simultaneously stream processed data to databases, applications, other AWS Services

OrderedStream

AmazonKinesis

Page 68: Machine Learning & Data Lake for IoT scenarios on AWS

Thankyou!

Page 69: Machine Learning & Data Lake for IoT scenarios on AWS

Getpredic<onswithAmazonMLbatchAPI

ProcessdatawithEMR

RawdatainS3Aggregateddata

inS3Predic)ons

inS3 Yourapplica)on

Batch predic@ons with EMR

Page 70: Machine Learning & Data Lake for IoT scenarios on AWS

Structured data In Amazon RedshiT

Load predic@ons into Amazon RedshiT

- or - Read predic@on results directly

from S3

Predic@ons in S3

Get predic@ons with Amazon ML batch API

Your applica@on

Batch predic@ons with Amazon RedshiT

Page 71: Machine Learning & Data Lake for IoT scenarios on AWS

Your applica@on

Get predic@ons with Amazon ML real-@me API

Amazon ML service

Real-@me predic@ons for interac@ve applica@ons

Page 72: Machine Learning & Data Lake for IoT scenarios on AWS

Your applica@on Amazon DynamoDB

+

Trigger events with Lambda +

Get predic@ons with Amazon ML real-@me API

Adding predic@ons to an exis@ng data flow

Page 73: Machine Learning & Data Lake for IoT scenarios on AWS

Recommenda@on engine

AmazonS3 AmazonRedshiA

AmazonML

DataCleansing

RawData

Trainmodel

BuildModels

S3Sta<cWebsite

Predic<ons