in artificial neural networks and humansmarcobaroni.org/publications/lectures/marco-horizon... ·...

Compositional generalizationin artificial neural networks

and humans

Marco Baroni

Facebook AI Research

Outline

• Recurrent neural networks

• A compositional challenge for recurrent neural networks

• How do humans dax twice?

Brenden LakeFAIRNYU

João LoulaFAIRMIT

Tal LinzenJHU

Artificial neural networks

input output

yorkshireterrier

“training” consists in optimally setting

network weights to produce right output for

each example input

The generality of neural networks

input output

I: images, O: object labelsI: documents, O: topics

I: pictures of cars, O: voting preferences …

training agnostic to nature of input and output

Taking time into account with recurrent connections

externalinput

output

state of the network atthe previous time step

Recurrent neural networksThe "unfolded" view

i4 i5 i6

i1 i2 i3

o3o1 o4 o5 o6

recurrentconnections

Modern RNNs (e.g., LSTMs) possess gating mechanism that improve temporal information flow

The generality of recurrent neural networks

i4 i5 i6

i1 i2 i3

o3o1 o4 o5 o6

I: English sentences, O: French sentencesI: linguistic instructions, O: action sequences

I: video game states, O: next actions …

Are we on the verge of general machine intelligence?

Lake et al. 2018

When are humans fast at learning?

•When evolution has done the slow learning work for us• Perception and categorization, naïve physics and

psychology, motor skills, core language faculties, reasoning...

•When new problems can be solved by combining old tricks (compositionality)

Compositional reasoning in 4-year olds

12Piantadosi and Aslin 2016

Screen combination seen in test phase only

Outline

• Brenden Lake and Marco Baroni. Generalization without systematicity: On the compositional skills of sequence-to-sequence recurrent networks. ICML 2018

• The SCAN challenge: https://github.com/brendenlake/SCAN/

Lots of earlier work on neural networks and compositionality, main novelty here is that we test latest-generation, state-of-the-art architectures!

Systematic compositionalityFodor and Pylyshyn 1988, Marcus 2003, 2018...

• Walk• Walk twice• Run• Run twice

• Walk• Walk twice• Run• Run twice• Dax

• Walk• Walk twice• Run• Run twice• Dax• Dax twice

[[X twice]] = [[X]][[X]][[dax]] = perform daxing action

... or perhaps meanings include algorithmic components such as: for (c=0,c<3,c++) {perform X}

Systematic compositionalityin a simple grounded environment

walk and turn left!WALK

Testing generalization

walk and jump leftWALK LTURN JUMP

look right andwalk left

RTURN LOOKLTURN WALK

run around rightRTURN RUN RTURN RUN RTURN RUN RTURN RUN

walkWALK

jump after walkWALK JUMP

run thriceRUN RUN RUN

walk and runRUN WALK

jump aroundand run

TRAINING PHASE TEST TIME

Sequence-to-sequence RNNsfor SCAN

twice and walk <EOS>

JUMP JUMP

<SOS> JUMP JUMP WALK

General methodology

• Train sequence-to-sequence RNN on 100k commands and corresponding action sequences

• At test time, only new composed commands presented

• Each test command presented once

• RNN must generate right action sequence at first try

• Training details: ADAM optimization with 0.001 learning rate and 50% teacher forcing

• Best model overall:• 2-layer LSTM with 200 hidden units per layer, no attention, 0.5 dropout 22

Experiment 1: random train/test split

• Included in training tasks:• look around left twice• look around left twice and turn left• jump right twice• run twice and jump right twice

• Presented during testing:• look around left twice and jump right twice

Random train/test split results

Experiment 2: split by action length

• Train on commands requiring shorter action sequences (up to 22 actions)• jump around left twice (16 actions)• walk opposite right thrice (9 actions)• jump around left twice and walk opposite right twice (22 actions)

• Test on commands requiring longer actions sequences (from 24 to 48 actions)• jump around left twice and walk opposite right thrice (25 actions)

A grammar must reflect and explain the ability of a speaker to produce and understand new sentences which may be longer than any he has previously heard (Chomsky 1956)

Length split results

Experiment 3: generalizing composition of a primitive command (the "dax" experiment)

• Training set contains all possible commands with "run", "walk", look", "turn left", "turn right":• "run", "run twice", "turn left and run opposite thrice", "walk after run", ...

• but only a small set of composed "jump" commands:• "jump", "jump left", "run and jump", "jump around twice"

• System tested on all remaining "jump" commands:• jump twice• jump left and run opposite thrice• walk after jump• ...

Composed-"jump" split results

RNN correctly executes "jump and run opposite right" but not, e.g., "jump and run"

Experiment 4: generalizing the composition of familiar modifiers

• Training set includes all commands except those containing the around right combination:• "run", "run around left", "jump right and run around left thrice", "walk right

after jump left", ...• System tested on around right commands:• run around right• jump left and walk around right• ...

• Also less challenging splits in which all X around right commands are added to training set for 1, 2, 3 distinct fillers (verbs)

29João Loula, Marco Baroni and Brenden Lake. Rearranging the familiar: Testing compositional generalization in recurrent networks. Blackbox WS at EMNLP 2018.

"Around right"-split results

Seq2seq models: conclusions

• State-of-the-art "Seq2Seq" Recurrent Neural Networks achieve considerable degree of generalization (Exp 1)...

• ... but this generalization does not appear to be "systematically compositional" in the Fodorian sense (Exps 2-4)

Outline

zup wif blicket

blicket

blicket wif dax

dax wif tufa

TRAINING

zup wif blicket

blicket

blicket wif dax

dax wif tufa

TRAINING

Lessons learned

•Confirmed that humans are fast learners, up to the ability to combine two functional elements zero-shot

•However, they need full access to training set while solving the task, incremental curriculum and performance is not at 100%

• Systematic biases emerge in error patterns35

85% accuracy (N=25)

76% accuracy (N=20)

Systematic biases in errors

zup wif blicket

blicket

blicket wif dax

TRAININGdax wif tufa

EXPECTED

dax wif tufa

ICONIC CONCATENATION

dax wif tufaONE-TO-ONE

"Blank state" experiments (subjects N = 29)

STIMULI: fep fep fepzup fep fep wif

fep dax fap kiki dax fepfep dax kiki

(Consistent) iconic concatenation (79.3% of participants)

One-to-one mapping (62.1% of participants)

Mutual exclusivity(95.7% of consistent participants)

58.6% of participants used words consistently and respected all biases

Human compositional skills: conclusions

• Humans are much faster at generalize (although they are not perfect composers either)• They display some consistent biases in generalization• Are human biases useful for fast learning?• Can we get neural networks to display the same biases?

thank yougrazie mille !

thank you !

in artificial neural networks and humansmarcobaroni.org/publications/lectures/marco-horizon... ·...

Documents

Dogs down under 3 Dogs Down Under...Terrier, whereas the Black and Tan Terrier is responsi-ble for the tan markings. Finally, the Yorkshire Terrier was used to control the size. There’s

mardigras yorkie 2 [Convrted] - The Yorkshire Terrier Club of ......for the YTCA & YTCGNY Yorkshire Terrier Specialties on this prestigious weekend. It is costly; therefore we must

The Yorkshire Terrier Judges Education Presentation Developed by Scanpoint Enterprises © Copyright 2005

YORKSHIRE TERRIER CLUB OF AMERICA, · PDF fileYORKSHIRE TERRIER CLUB OF AMERICA, INC. Memphis, Tennessee 2013 PLANNING BOOK The Roving is dedicated in memory of Susan Griffin – Peppertree

In Session - American Kennel Clubimages.akc.org/pdf/governmentrelations/documents/InSession_Spring... · security guard and drove on in with-out incident! ... Yorkshire Terrier to

The Yorkshire Terrier - American Kennel Club · The three photos we have chosen above give you a pictorial progression of the Yorkshire terrier, starting with the very beginnings

El Manual del Yorkshire · El Manual del Yorkshire Terrier Pet Fashion El presente manual consiste en una recopilación de datos de libros especializados, revistas y por supuesto

Hundido's Breed Spotlight: Yorkshire Terrier

Yorkshire Terrier Club of NSW...Wow - what a huge day at Dogs on Show, representing the 7am and didn’t leave until after 5.30pm. Our Club’s theme was the history of the Yorkshire

Yorkshire Terrier - Manualidades a Raudales...The Yorkshire Terrier was developed in the Yorkshire region of England in the mid-19th century, in order to catch the rats that were plaguing

BORDER UNION AGRICULTURAL SOCIETY · BOB Yorkshire Terrier Glenconnor Cup BOS Yorkshire Terrier BOB Border Terrier Mr W. Douglas Miss Jean Cook Memorial Stakes B.U.A.S Mrs Margaret

…Even as Abraham Khinckley1@yahoo.com. Classified Ads FREE YORKSHIRE TERRIER. 8 years old. Hateful little dog. Bites FREE PUPPIES: 1/2 Cocker Spaniel,

Yorkshire Terrier Breed Technical Brochure

Livros Digitais...Yorkshire Terrier Origins: Yorkshire. England. This area is known for producting good animals. Yorkshire didnt surge for accident, it was result of the crossing of

A Study and Comparison of Human and Deep Learning ...Yorkshire Terrier Basenji Dalmatian Dalmatian Basenji Staffordshire Bullterrier Border Collie Staffordshire Bullterrier Actual

TWO NATIONAL ALL BREEDS DRIVE IN DOG SHOWS¼gis 2020...Terrier, West Highland White Terrier, Yorkshire Terrier, Kerry Blue Terrier, Norwich Terrier, Skye Terrier, Brazilian Terrier,

YORKSHIRE TERRIER CLUB THE NATION'S CAPITAL, INC. · #2015241401 & 2015241402 Yorkshire Terrier Club of the Nation’s Capital, Inc. Page 2 There is a service that you can use to

YORKSHIRE TERRIER CLUB OF AMERICA, INC - Welcome to YTCA

Watchung Mountains Yorkshire Terrier Club, Inc. · Watchung Mountains Yorkshire Terrier Club, Inc. Page 1 #2020371601, #2020371602, #2020371603 ENTRIES CLOSE: 9:00 P.M., WEDNESDAY,

OFFICIAL PREMIUM LIST CALEDON KENNEL ASSOCIATION All … · Border Terrier, Cairn Terrier, Fox Terrier (Smooth & Wire), Glen of Imaal, Irish Terrier, Norfolk Terrier, Norwich Terrier,