28
1 © 2018 KPMG LLP, a Canadian limited liability partnership and a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved. The KPMG name and logo are registered trademarks or trademarks of KPMG International. 1 Hacking Life: with ML and AI - Guy Rosario, KPMG Cyber BC Security Day November 7 th 2018

Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

  • Upload
    others

  • View
    5

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

1 © 2018 KPMG LLP, a Canadian limited liability partnership and a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative

(“KPMG International”), a Swiss entity. All rights reserved. The KPMG name and logo are registered trademarks or trademarks of KPMG International.

1

Hacking Life: with ML and AI - Guy Rosario, KPMG Cyber

BC Security Day

November 7th 2018

Page 2: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

2

Guy Rosario

Manager, KPMG

250 480 3608

[email protected]

Introducing

– Welcome

– What is it - TL;DR

– What’s the underlying problem

– Where do we go from here

– Wrap-up & questions

Agenda

© 2018 KPMG LLP, a Canadian limited liability partnership and a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved. The KPMG name and logo are registered trademarks or trademarks of KPMG International.

Page 3: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

3

What is Personal data?

What is it?

TL;DR

A.I. is basically an umbrella term for it all

• The starting point is easy. Simply put, artificial intelligence

is a sub-field of computer science. Its goal is to enable the

development of computers that are able to do things

normally done by people -- in particular, things associated

with people acting intelligently. [source:Stanford]

• Stanford researcher John McCarthy coined the term in

1956

Page 4: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

4

What is Personal data?

What is it?

TL;DR

Basically, any program can be considered AI if it does

something that we would normally think of as intelligent in

humans. How the program does it is not the issue, just that is

able to do it at all.

…That is, it is AI if it is smart, but it doesn’t

have to be smart like us…

Page 5: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

5

What is Personal data?

What is it?

TL;DR

Cont’d

Page 6: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

6

What is Personal data?

What is it?

TL;DR

So… Machine intelligence = A.I.

• Machine learning is also a blanket term

covering multiple technologies

• Neural networks are a type of machine

learning, and deep learning refers to one

particular kind

• Cognitive computing: It's complicated

Page 7: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

7

What is Personal data?

Artificial

Intelligence

- The Good

Example Use Cases:

• Predicting ICU Transfers

• Medical Testing

• Fraud Mitigation

• Customer Retention

• Predictive Manufacturing Design

• Predictive Maintenance

Page 8: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

8

What is Personal data?

Artificial

Intelligence

- The Good

Other applications of AI:

Page 9: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

9

What is Personal data?

Artificial

Intelligence

- The Bad

It can be used to trick people…

• Fake News, Images, Video, Audio images

• …if you’re interested

• https://web.stanford.edu/~zollhoef/papers/SG2018_

DeepVideo/page.html

• https://arxiv.org/abs/1802.08195

Page 10: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

10

What is Personal data?

Machine

Learning What can ML do?

• The Good

Page 11: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

11

What is Personal data?

Machine

Learning The Bad

• Used in Dark Web in

Hivenets and

Swarmbots

• Used in phishing

schemes

• Attacking cloud

service providers

themselves

• Oh… they usually

don’t like us… (>_>)

Page 12: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

12

What is Personal data?

Getting the

“gist” of AI If you REALLY wanna learn about it, here's

some free training:

• Learn TensorFlow and Deep Learning, Without a Ph.D.

• ColumbiaX Artificial Intelligence

• ColumbiaX MicroMaster Certificate in Artificial

Intelligence

• Machine Learning for Musicians and Artists

• MIT 6.803 The Human Intelligence Enterprise

• MIT 6.S094: Deep Learning for Self-Driving Cars

• MIT Course 6.034 Artificial Intelligence

• NPTEL Artificial Intelligence

Page 13: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

13

What is Personal data?

What’s the

underlying

problem?

There is no well-defined strategy

If you’re going to remember anything from my talk, it

should be this… more on this at the end of the talk.

What are the challenges AI and ML can solve, if we have a

strategy:

• Dealing with too much data

• Slow response time once we detect things

• Lack of visibility

Page 14: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

14

What is Personal data?

Where do

we go from

here?

What should we all focus on moving forward:

• We need to use AI and ML to get visibility

• We need to understand the world of the

possible

• We need to address the skills shortage via

AI and ML

• We need to revisit strategy and process

Page 15: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

15

What is Personal data?

Where do

we go from

here?

Instead of AI, we could use smart humans, by

training them:

• Instead of searching logs like this… • time ls somelog* | while read i

• Consider that you could split a 1GB log file super fast,

and then parse it parallel: • split -a 2 -d -l 2000000 some.log somelog-20180113.spl $ ls –

al somelog-20180113.spl?? | head -3

• And search it like this: • ls somelog* | xargs -P 64 -L 8 xzcat | grep

superbadwebsite.org | wc –c

Page 16: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

16

What is Personal data?

Where do

we go from

here?

How does this relate to cyber security?

1. We’re running out of people…

2. No visibility into our environments, data,

systems, etc. – to understand how we could be

hurt

3. We’re not training our tools to how to be

suspicious to understand what is abnormal

behavior

Page 17: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

17

What is Personal data?

Where do

we go from

here?

As many of you here know, I’ve gotten past some multi-

million dollar “AI-based” security systems without getting

caught…

• Just say’n…( ͡º ͜ʖ ͡º)

“What is more visible, is more measurable –

and thus, can be improved.”

• “But, how?..” you say?.. “It’s like finding a

needle in a haystack?”

Page 18: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

18

What is Personal data?

Where do

we go from

here?

“I burn the friggin’ hay and use one

hell of a magnet to find the needles!”

• KPMG uses ML toolkits to solve problems around:

• Network Traffic

• Host Data

• Logs

• Intelligence

• … and good ‘ole fashion process…

• And we’re going to teach people and technology…

<insert shameless plug here.>

Page 19: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

19

What is Personal data?

Where do

we go from

here?

“I burn the friggin’ hay and use one hell of a

magnet to find the needles!”

• Network Analysis is automated, and can be scripted

• We can do file extraction and analysis

• Detect entropy

• Logging can filtered and categorized, and heuristics

done on the fly

• We can “fake an attacker” to validate if your “stuff”

does what it said it does.

• We can provide you with the controls to help fix your

stuff using industry standard formats.

Page 20: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

20

What is Personal data?

Where do

we go from

here?

“I burn the friggin’ hay and use one hell of a

magnet to find the needles!”

• Oh… and we can do it using, on-premise or in the cloud and

open source, commercial, or a mix of the two…

• KPMG Lighthouse

• Center of Excellence for Data & Analytics: This center

enables the delivery of analytic capabilities seamlessly

across geographies and member firms to bring the right

services and talent to clients. KPMG Lighthouse provides

an integrated data and analytics platform that leverages

expertise in software and data engineering, data science,

advanced visualization, artificial intelligence and robotics.

Page 21: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

21

What is Personal data?

Where do

we go from

here?

“I burn the friggin’ hay and

use one hell of a magnet to

find the needles!”

• And boy do we have

processes:

• ITIL

• COBIT

• ISO

• Etc.

Page 22: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

22

What is Personal data?

Where do

we go from

here?

The key takeaways:

1. Train people and technology to “think like

a bad guy”

2. Anticipate the good and bad of AI

3. Have a strategy

Page 23: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

23

Got questions?

Guy Rosario, ITIL

Manager, KPMG

(250) 480 3608

[email protected]

@GuyRosario

Contact us about AI and Machine Learning

© 2018 KPMG LLP, a Canadian limited liability partnership and a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. All rights reserved. The KPMG name and logo are registered trademarks or trademarks of KPMG International.

Page 24: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

Thank you

Page 25: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

kpmg.ca

The information contained herein is of a general nature and is not intended to address the circumstances of any particular

individual or entity. Although we endeavour to provide accurate and timely information, there can be no guarantee that such

information is accurate as of the date it is received or that it will continue to be accurate in the future. No one should act on

such information without appropriate professional advice after a thorough examination of the particular situation.

© 2018 KPMG LLP, a Canadian limited liability partnership and a member firm of the KPMG network of independent member

firms affiliated with KPMG International Cooperative (“KPMG International”), a Swiss entity. 18291

All rights reserved. The KPMG name and logo are registered trademarks or trademarks of KPMG International.

Page 26: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

26 © 2018 KPMG LLP, a Canadian limited liability partnership and a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative

(“KPMG International”), a Swiss entity. All rights reserved. The KPMG name and logo are registered trademarks or trademarks of KPMG International.

26

What is Personal data?

Learning

References

Branches of AI: • Common Sense Knowledge and Reasoning

• Epistemology

• Genetic Programming

• Heuristics

• Inference

• Learning from Experience

• Logical AI

• Ontology

• Pattern Recognition

• Planning

• Representation

• Search AI

AI can be used to do some “not-cool” things: • It can be used to trick people…

• Fake News

• Gov't Sponsored

• Private interests sponsored

• Half of Russian Twitter feeds were bots last year. - Oxford

Internet Institute.

Learning AI websites: • Association for the Advancement of Artificial Intelligence

• AWS SageMaker and DeepLens

• Azure Machine Learning

• Coursera

• Google Cloud Machine Learning Engine

• IBM Watson Website

• Kaggle (For testing your AI skillz)

• OpenAI

• Quora AI Feed

AI platforms to play with: • Azure Machine Learning Studio

• Deep Cognition

• Google Cloud Machine Learning Engine

• IBM Watson Studio

• Pega Platform

• TensorFlow

Conversational Intelligence Tools • Alexa for Business

• Drift

• Google Cloud Translation API

• Intercom

• TARS

• Zendesk Answer Bot

Deep Learning Platforms: • Amazon Lex

• Amazon Rekognition

• Dragon Speech Recognition Software

• Microfost Bing Speech API

• Microsoft Computer Vision API

• OpenCV

AI Tools, Libraries and Open Source: • Cognitive Network Toolkit (CNTK)

• Microsoft

• Runs on Linux and Microsoft

• Plays with Python!

• OpenAI Gym

• Deep Learning for Java

• Deeplearning4j

• Yahoo

• Deep Learning Framework

AI Tools, Libraries and Open Source: • CaffeOnSpark

• Voice, Image and Video recognition open sourced

• Caffe

• AI One

• DiffBlue

• InfoSys Nia

• Keras

• Nervana Neon

• OpenNN

• Protege

Page 27: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

27 © 2018 KPMG LLP, a Canadian limited liability partnership and a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative

(“KPMG International”), a Swiss entity. All rights reserved. The KPMG name and logo are registered trademarks or trademarks of KPMG International.

27

What is Personal data?

Learning

References

AI Tools, Libraries and Open Source: • PyBrain

• Samsun's Veles

• Scikit-Learn

• Swift AI

• Theano

• Torch

… but the big players, (imo) • Amazon

• Amazon Lex

• Amazon Polly

• Amazon Rekognition Image

• Apache

• Apache Singe

• Microsoft

• Azure Machine Learning

• IBM Watson

If you REALLY wanna learn about it, here's some free training: • ColumbiaX Artificial Intelligence

• ColumbiaX MicroMaster Certificate in Artificial Intelligence

• Creative Applications of Deep Learning with TensorFlow

• Fast.ai 2: Cutting-Edge Deep Learning for Coders

• Fast.ai Practical Deep Learning for Coders

• GTx Machine Learning

• Intel Nervana AI Academy

• Khan Academy: Algorithms

• Learn TensorFlow and Deep Learning, Without a Ph.D.

• Learning from Data

• Machine Learning for Musicians and Artists

• MIT 6.803 The Human Intelligence Enterprise

• MIT 6.S094: Deep Learning for Self-Driving Cars

• MIT Course 6.034 Artificial Intelligence

• NPTEL Artificial Intelligence

• OpenSAP Enterprise Deep Learning with TensorFlow

• QUT Big Data: Statistical Inference and Machine Learning

• Saylor.org CS405: Artificial Intelligence

• Titanic: Machine Learning from Disaster

• UC Berkeley CS 294 Deep Reinforcement Learning

• UC Berkeley CS188 Intro to AI

• UCL COMPM050/COMPGI13: Reinforcement Learning

Not free, but good: • Artificial Intelligence A-Z: Learn How to Build an AI

• Artificial Intelligence: Reinforcement Learning in Python

• CMU Statistical Machine Learning

• Coursera Deep Learning Specialization

• Coursera Machine Learning

• Coursera Neural Networks for Machine Learning

• Coursera Practical Machine Learning

• Georgia Tech CS 6601: Artificial Intelligence • Microsoft Professional Program in Data Science

• Stanford University CS221: Artificial Intelligence Principles and Techniques

• Udacity Artificial Intelligence Engineer Nanodegree Program

• Udacity Machine Learning Engineer Nanodegree Program

• University of Oxford Deep Learning for Natural Language Processing

Great books that cover AI and ML: • How to Create a Mind

• Machine Learning and Security

• The Future of the Mind

Some of it's open source: • Facebook

• Torch

• SystemML

• IBM

• h20.ai

• Driverless AI

• Sparkling Water

• H2O AI Platform

• Open Neural Network Library

• OpenNN

• Oryx 2

• OpenCyc

• NuPIC

• LF Deep Learning Foundation

• Acumos AI

• Apache

• Apache Spark

• Machine Learning Library

• MLlib

• Apache Mahout

• Apache PredictionIO

Page 28: Hacking Life: with ML and AI - British Columbia · 2018-11-23 · • Amazon • Amazon Lex • Amazon Polly • Amazon Rekognition Image • Apache • Apache Singe • Microsoft

28 © 2018 KPMG LLP, a Canadian limited liability partnership and a member firm of the KPMG network of independent member firms affiliated with KPMG International Cooperative

(“KPMG International”), a Swiss entity. All rights reserved. The KPMG name and logo are registered trademarks or trademarks of KPMG International.

28

What is Personal data?

Learning

References

Machine Learning Software: • scikit-learn

• Machine-Learning in Python

• BigML

• Microsoft Bing Web Search API

• Microsoft Bing Image Search API

• Google Dialogflow Enterprise Edition

We don’t know what to do about it:

Online articles around AI, ML and the law: • Artificial Intelligence is Trade Policy's New Frontier

• An incredible amount of data flows over borders to power artificial intelligence — but

the rules have not been fully or publicly debated

• The Policy Deficit Behind Canadian Artificial Intelligence

• The longer Canada defers effective AI governance, the harder it will be to catch up

• Artificial Intelligence used for law

• Formal models of legal reasoning

• How the law views AI/ML

• There's laws for humans, what about something that thinks?