34
Mandy Korpusik, Jim Glass MIT Computer Science and Artificial Intelligence Laboratory Cambridge, MA, USA ICASSP, Brighton, UK May 16, 2019 Dialogue State Tracking with Convolutional Semantic Taggers

Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Mandy Korpusik, Jim GlassMIT Computer Science and Artificial Intelligence Laboratory

Cambridge, MA, USA

ICASSP, Brighton, UKMay 16, 2019

Dialogue State Tracking with Convolutional Semantic

Taggers

Page 2: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Motivation: Spoken Diet Tracking

2

Coco Nutritionist lets you record what you ate with everyday spoken natural language.

Page 3: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

3

Page 4: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Motivation: Nutrition Multi-turn Dialogue

Nutrition question answering– is grilled chicken or red meat better?– What should I eat for dinner?– What is a healthy breakfast– Which cereal is best to keep you satisfied?– How many calories in ## of food item– Is milk healthy? …

Personalized food recommendation(Korpusik et al., CBRecSys, 2016)

4

Page 5: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Nutrition question answering– is grilled chicken or red meat better?– What should I eat for dinner?– What is a healthy breakfast– Which cereal is best to keep you satisfied?– How many calories in ## of food item– Is milk healthy? …

Personalized food recommendation(Korpusik et al., CBRecSys, 2016)

4

Motivation: Nutrition Multi-turn Dialogue

Page 6: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Overview

• Motivation: Nutrition• Introduction• Our work in 3 state tracking challenges:

– DSTC7– DSTC6– DSTC2

• Conclusion

5

Page 7: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Introduction: Spoken Dialogue Systems

Semantic Tagging

Intent Detection Booking a restaurant

RestaurantKnowledge Base

can you book a table for two in bombay in a cheap price range

6

Database Retrieval

Page 8: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Introduction: Spoken Dialogue Systems

7

Our Goal: Develop ability to do dialogue state tracking.

Page 9: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Introduction: Dialogue State Tracking

8

System: any preference on a type of cuisine

Page 10: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Dialogue State Tracking Challenges (DSTC)

• DSTC1 (2013): human-computer bus timetables• DSTC2 and 3 (2014): human-computer restaurant info• DSTC4 (2015): human-human tourist info• DSTC5 (2016): multilingual tourist info• DSTC6 (2017): 3 tracks, end-to-end learning• DSTC7 (2019): 3 tracks (response selection, generation,

and audio-visual)

9

Page 11: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Dialogue State Tracking Challenges (DSTC)

• DSTC1 (2013): human-computer bus timetables• DSTC2 and 3 (2014): human-computer restaurant info• DSTC4 (2015): human-human tourist info• DSTC5 (2016): multilingual tourist info• DSTC6 (2017): 3 tracks, end-to-end learning• DSTC7 (2019): 3 tracks (response selection, generation,

and audio-visual)

9

Page 12: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC7

Student-Advisor Partial Dialogue:

10

(Korpusik et al., DSTC7 2019)

Page 13: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC7: Convolutional Neural Encoder

11

(Korpusik et al., DSTC7 2019)

Page 14: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6

12

Goal: select the best system response.

(Bordes & Weston, 2016)

Page 15: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Task 1: API Call

13

api_call indian bombay two cheap casual

RestaurantKnowledge Base

Page 16: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Task 2: Updating API Call

14

Page 17: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6 Data

15

• 10,000 simulated training dialogues per task• KB of restaurants

– 10 cuisines– 10 locations– 3 price ranges– 4 party sizes

Page 18: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6: Related Work

• 2 challenge participants achieved 100% on all tasks:– Extended Hybrid Code Networks for DSTC6 (Ham et al., 2017)

– Modeling Conversations to Learn Responding Policies (Bai et al., 2017)

16

Page 19: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6: Related Work

• 2 challenge participants achieved 100% on all tasks:– Extended Hybrid Code Networks for DSTC6 (Ham et al., 2017)– Modeling Conversations to Learn Responding Policies (Bai et

al., 2017)

16

Page 20: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6: Our Binary CNN Baseline

17

Page 21: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6: Our Full CNN Architecture

18

Approach: 1. Select action template with CNN.2. Populate action template with CNN-predicted semantic tags.

Page 22: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6: Our Full CNN Architecture

19

Approach: 1. Select action template with CNN.2. Populate action template with CNN-predicted semantic tags.

Page 23: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Step 1: Semantic Tagging

20

(Korpusik et al., SLT 2014)(Korpusik et al., ICASSP 2016)(Korpusik et al., Trans. ASLP 2017)

Problem: prior state-of-the-art Conditional Random Field (CRF) model requires hand-crafted features.

Solution: use a neural network to automatically learn features during training.

Page 24: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Step 1: Semantic Tagging

21

Page 25: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Step 1: Generating Tagging Data

22

api_call indian bombay two cheap casual…

Page 26: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Step 1: Semantic Tagging Results

23

Page 27: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Step 2: Action Template Selection

24

Page 28: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Step 3: Final Response Generation1) Action mask

api_call: masked out if any slots are still unspecified.request_api_slot: masked out if all slots are specified.

25

2) Use dialogue stateapi_call: populate slots with values in dialogue state.

request_api_slot: select the next slot missing a value.

api_call indian bombay two casualcheap

Page 29: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

DSTC6: Test Results

26

100% precision on both tasks

Page 30: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

From Simulated to Real Data: WOZ 2.0Task: predict all the user’s requested and informableslots at each turn in a restaurant booking dialogue.

27

(Henderson et al., SLT 2014)Neural Belief Tracker: (Mrksic et al., ACL 2017)GLAD: (Zhong et al., ACL 2018)

(Korpusik et al., SLT 2018)

Page 31: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

From Simulated to Real Data: WOZ 2.0

Our Goal: Show our NN models generalize to real dialogues.

28

(Korpusik et al., SLT 2018)

Page 32: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

From Simulated to Real Data: WOZ 2.0

29

(Korpusik et al., SLT 2018)

Separately trained informable slot models.

Page 33: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

From Simulated to Real Data: WOZ 2.0

30

CNN is competitive with state-of-the-art, without requiring semantic dictionaries or pre-trained word vectors.

Page 34: Dialogue State Tracking with Convolutional Semantic Taggerspeople.csail.mit.edu/korpusik/icassp19.pdf · Motivation: Nutrition Multi-turn Dialogue Nutrition question answering –

Conclusion

31

Demonstrated our neural network models’ ability to do dialogue state tracking in several domains.

Future Work:• Experiment on the remaining DSTC6 subtasks.• Jointly train tagger and action selector as end-to-end model.• Automatically learn action mask by adding a feature to action

selector model indicating whether all slots have values.• Apply these techniques to the nutrition domain!