40
Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins GROK - Numenta [email protected]

Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Computing Like the Brain The Path To Machine Intelligence

Jeff Hawkins

GROK - Numenta [email protected]

Page 2: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

1) Discover operating principles of neocortex

2) Build systems based on these principles

Page 3: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Alan Turing

“Computers are universal machines” 1935

“Human behavior as test for machine intelligence” 1950

Pros: - Good solutions Cons: - Task specific - Limited or no learning

Artificial Intelligence - no neuroscience

• MIT AI Lab • 5th Generation Computing Project • DARPA Strategic Computing Initiative • DARPA Grand Challenge

Major AI Initiatives

AI Projects

• ACT-R • Asimo • CoJACK • Cyc • Deep Blue • Global Workspace Theory • Mycin • SHRDLU • Soar • Watson - Many more -

Page 4: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Artificial Neural Networks – minimal neuroscience

Warren McCulloch

Walter Pitts

“Neurons as logic gates” 1943

Proposed first artificial neural network

ANN techniques

Pros: - Good classifiers - Learning systems Cons: - Limited - Not brain like

• Back propagation • Boltzman machines • Hopfield networks • Kohonen networks • Parallel Distributed Processing • Machine learning • Deep Learning

Page 5: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Whole Brain Simulator – maximal neuroscience

The Human Brain Project

No theory No attempt at Machine Intelligence

Blue Brain simulation

Page 6: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

1) Discover operating principles of neocortex

2) Build systems based on these principles

Anatomy, Physiology

Theory Software Silicon

Good progress is being made

1940s in computing = 2010s in machine intelligence

Page 7: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

The neocortex is a memory system.

data stream retina

cochlea

somatic

The neocortex learns a model from sensor data

- predictions - anomalies - actions

The neocortex learns a sensory-motor model of the world

Page 8: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Principles of Neocortical Function

retina

cochlea

somatic

1) On-line learning from streaming data

data stream

Page 9: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Principles of Neocortical Function

2) Hierarchy of memory regions

- regions are nearly identical

retina

cochlea

somatic

1) On-line learning from streaming data

data stream

Page 10: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Principles of Neocortical Function

2) Hierarchy of memory regions

retina

cochlea

somatic

3) Sequence memory

- inference

- motor

data stream

1) On-line learning from streaming data

Page 11: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Principles of Neocortical Function

4) Sparse Distributed Representations

2) Hierarchy of memory regions

retina

cochlea

somatic

3) Sequence memory

data stream

1) On-line learning from streaming data

Page 12: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Principles of Neocortical Function

retina

cochlea

somatic

data stream

2) Hierarchy of memory regions

3) Sequence memory

5) All regions are sensory and motor

4) Sparse Distributed Representations

Motor

1) On-line learning from streaming data

Page 13: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Principles of Neocortical Function

retina

cochlea

somatic

data stream

x x x

x x x

x x x x x x

x

2) Hierarchy of memory regions

3) Sequence memory

5) All regions are sensory and motor

6) Attention

4) Sparse Distributed Representations

1) On-line learning from streaming data

Page 14: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Principles of Neocortical Function

4) Sparse Distributed Representations

2) Hierarchy of memory regions

retina

cochlea

somatic

3) Sequence memory

5) All regions are sensory and motor

6) Attention

data stream

1) On-line learning from streaming data

These six principles are necessary and sufficient for biological and

machine intelligence.

- All mammals from mouse to human have them

- We can build machines like this

Page 15: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Sparse Distributed Representations (SDRs)

• Many bits (thousands)

• Few 1’s mostly 0’s

• Example: 2,000 bits, 2% active

• Each bit has semantic meaning (learned)

• Representation is semantic

01000000000000000001000000000000000000000000000000000010000…………01000

Dense Representations

• Few bits (8 to 128)

• All combinations of 1’s and 0’s

• Example: 8 bit ASCII

• Individual bits have no inherent meaning

• Representation is arbitrary

01101101 = m

Page 16: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

SDR Properties

1) Similarity: shared bits = semantic similarity

subsampling is OK

3) Union membership:

Indices 1 2 | 10

Is this SDR a member?

2) Store and Compare: store indices of active bits

Indices 1 2 3 4 5 | 40

1) 2) 3)

….

10)

2%

20%

Union

Page 17: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Coincidence detectors

How does a layer of neurons learn sequences?

Sequence Memory (for inference and motor)

Page 18: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Each cell is one bit in our Sparse Distributed Representation

SDRs are formed via a local competition between cells.

All processes are local across large sheets of cells.

Page 19: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

SDR (time =1)

Page 20: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

SDR (time =2)

Page 21: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Cells connect to sample of previously active cells to predict their own future activity.

Page 22: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Multiple Predictions Can Occur at Once.

This is a 1st order memory. We need a high order memory.

Page 23: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

High order sequences are enabled with multiple cells per column.

Page 24: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

High Order Sequence Memory

0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 0 0 1 0 0 0 0…………0 1 0 0 0

40 active columns, 10 cells per column

= 1040 ways to represent the same input in different contexts

A-B-C-D-E

X-B’-C’-D’-Y

0 1 0 0 0 0 0 0 1 0 0 0 0 0 0 0 0 0 1 0 0 1 0 0 0 0…………0 1 0 0 0

Page 25: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

High Order Sequence Memory

Distributed sequence memory

High order, high capacity

Noise and fault tolerant Multiple simultaneous predictions Semantic generalization

Page 26: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Online learning

• Learn continuously, no batch processing

• If pattern repeats, reinforce, otherwise forget it

Connection strength is binary

Connection permanence is a scalar

Training changes permanence

Learning is the growth of new synapses.

1 0

connected unconnected

Connection permanence 0.2

Page 27: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

“Cortical Learning Algorithm” (CLA)

Not your typical computer memory!

A building block for

- neocortex

- machine intelligence

Page 28: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Feedforward inference

Feedforward inference

Motor output

Feedback / attention

2 m

m

2 m

m

sequence memory

sequence memory

sequence memory

sequence memory

CLA CLA CLA CLA

Evidence suggests each layer is implementing a CLA variant

Cortical Region

Page 29: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

1) Commercialization

- GROK: Predictive analytics using CLA

- Commercial value accelerates interest and investment

2) Open Source Project

- NuPIC: CLA open source software and community

- Improve algorithms, develop applications

3) Custom CLA Hardware

- Needed for scaling research and commercial applications

- IBM, Seagate, Sandia Labs, DARPA

What Is Next? Three Current Directions

Page 30: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Fie

ld 1

Fie

ld 2

Fie

ld 3

Fie

ld N

Fie

ld 1

Fie

ld 2

Fie

ld 3

Fie

ld N

Fie

ld 1

Fie

ld 2

Fie

ld 3

Fie

ld N

numbers categories text

date

time

Sequence Memory

2,000 cortical columns 60,000 neurons - variable order - online learning

encoder

encoder

encoder

encoder

SDRs Predictions Anomalies

GROK: Predictive Analytics Using CLA

Encoders

Convert native data type to SDRs

CLA

Learns spatial/temporal patterns Outputs - predictions anomalies

Actions

Page 31: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

GROK example: Factory Energy Usage

Page 32: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Customer need

At midnight, make 24 hourly predictions

Page 33: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

GROK Predictions and Actuals

Page 34: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

GROK example: Predicting Server Demand

Date

Actual Predicted

Server demand, Actual vs. Predicted

Grok used to predict server demand

Approximately 15% reduction in AWS cost

Page 35: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

GROK example: Detecting Anomalous Behavior

Grok builds model of data, detects changes in predictability.

Gear bearing temperature & Grok Anomaly Score

GROK going to market for anomaly detection in I.T. 2014

Page 36: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

NuPIC: www.Numenta.org

- CLA source code (single tree), GPLv3 - Papers, videos, docs

Community

- 200+ mail list subscribers, growing - 20+ messages per day - full time manager, Matt Taylor

What you can do

- Get educated - New applications for CLA - Extend CLA: robotics, language, vision - Tools, documentation

2nd Hackathon November 2,3 in San Francisco

- Natural language processing using SDRs - Sensory-motor integration discussion - 2014 hackathon Ireland?

2) Open Source Project

Page 37: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

HW companies looking “Beyond von Neumann”

- Distributed memory - Fault tolerant - Hierarchical

New HW Architectures Needed - Speed (research) - Cost, power, embedded (commercical)

IBM - Almaden Research Labs - Joint research agreement

DARPA - New Program called “Cortical Processor” - HTM (Hierarchical Temporal Memory)

- CLA is prototype primitive

Seagate

Sandia Labs

3) Custom CLA Hardware

Page 38: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Future of Machine Intelligence

Page 39: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Future of Machine Intelligence

Definite

- Faster, Bigger - Super senses - Fluid robotics - Distributed hierarchy Maybe

- Humanoid robots - Computer/Brain

interfaces for all Not

- Uploaded brains - Evil robots

Page 40: Computing Like the Brain - GOTO Conferencegotocon.com/dl/goto-aar-2013/slides/JeffHawkins_ComputingLikeTh… · Computing Like the Brain The Path To Machine Intelligence Jeff Hawkins

Why Create Intelligent Machines?

Thank You

Live better Learn more