Semi-supervised Dialogue Act Recognition Maryam Tavafi

Semi-supervised Dialogue Act Recognition

Maryam Tavafi

Motivation

Detecting the human social intentions in spoken conversations

• Dialogue summarization• Collaborative task learning agents• Dialogue systems• ...

Method for Semi-supervised DA modeling

SVM-hmm with bootstrapping

The features for the classification are:

• Unigrams in the sentence

• Speaker of the sentence

• Relative position of the sentence in the post

• Length of the sentence, in terms of the number of its

Framework

SVM-hmm

• SVM-hmm classification is based on Viterbi algorithmo Viterbi score of a sequence

Confident Score

1. Rank all the sequences based on Viterbi score and choose

top X sequences

2. Rank all the sequences based on the Viterbi score

normalized by the length of the sequence and choose top X

sequences

3. Sort sequences by their length. Group them into 5 groups,

and rank them in each group based on Viterbi score. Choose

X sequences from the first group, X-Y from the second, X-

2*Y from the third, and so on. (X and Y are the parameters)

Corpora-Asynchronous Conversations

• Email

o Labeled dataset: BC3

o Unlabeled dataset: W3C

o Tagset: 12 DAs

• Forum

o Labeled dataset: CNET

o Unlabeled dataset: BC3 Blog

o Tagset: 11 DAs

Corpora-Synchronous Conversations

• Meeting

o MRDA

o Tagset: 11 DAs

• Phone

o SWBD

o Tagset: 16 DAs

Results

Supervised with SVM-hmm (Baseline is majority class)

Results

Semi-supervised on Email (comparison of choosing top examples)

Results

• SWBDo no significant improvemento small dataset

• MRDAo small improvement using bining approach

• CNETo no significant improvemento thread structure of the unlabeled data was not

available

Lessons learned

• Email conversations benefit the most from adding unlabeled data

• When using Viterbi score as a confidence score for SVM-hmm, we should consider the length difference between sequenceso normalize the score by the length

Evaluation

• Showed SVM-hmm performs well for DA modeling on different domains

• Bootstrapping performed better on the email dataseto We need large unlabeled dataset for DA modeling

Future Work

• Other semi-supervised techniques

• Parameter for confident score

• Additional featureso Bigrams, trigrams, POS tags, prosodic features for

meeting and phone

Questions?

Semi-supervised Dialogue Act Recognition Maryam Tavafi

Documents

Presentation1 -maryam safa

Curriculum Vitae - University of Torontoparham/cv.pdfAhsan Khan David Halupka (Jointly supervised by Prof. Sheikholeslami) Maryam Modir Shanechi Weiyu Gao Teddy Atmadja 2003 Summer

Voltage Controlled Oscillators Prepared by : Yasmin Mohamed Salma fareed Maryam Magdy Supervised by : Dr.Mohamed Abdelghany 1

Maryam Al Suwaidi Portfolio

Introduction to Surah Maryam

Maryam Nisar

Robotics by Maryam javed

Procedures & Concurrency in Ada Thanks to: Fatemeh Salehi, Maryam Foroughi Fatemeh Farzian, Maryam Khademi Fatemeh Farzian, Maryam Khademi

Maryam dec-2013-10

ARABIC :: 19. Maryam(Mary) -Makkah

Maryam mardjan presentation

N.U.K. SHERWANI & MARYAM FATIMA

Radicular cyst (maryam arbab)

Surah Maryam ميرم ةروس

19 sourate maryam

MARYAM ESMAEILZADEH, MD, FACC, FCAPSC Maryam... · MARYAM ESMAEILZADEH, MD, FACC, FCAPSC Associate Professor of Cardiology meszadeh@rhc.ac.ir SUMMARY OF QUALIFICATIONS Broad knowledge

Maryam Intern Report

MARYAM JABEEN

Surat Maryam (Mary)

Surah maryam