Bang-Xuan Huang Department of Computer Science & Information Engineering

Utterance classification with discriminativelanguage modelingMurat Saraclar , Brian Roark

Speech Communciation, 2006

Bang-Xuan Huang

Department of Computer Science & Information Engineering

National Taiwan Normal University

Outline

• Introduction

• Method

• Experimental results

• Discussion

Introduction

• Reductions in WER are critical for applications making use of automatic speech recognition (ASR), but the key objective of a particular application may be different.

• In this paper, we perform a range of experiments investigating the benefit of simultaneous optimization of parameters for both WER and class-error rate (CER) reductions.

• We demonstrate that simultaneously optimizing parameter weights for both n-gram and class features provides significant reductions in both WER and CER.

Method

Feature definitions•The feature set investigated in the current paper includes:

(1)the scaled cost given by the baseline ASR system, i.e.

(2)unigram, bigram and trigram counts in the utterance, e.g. C(w1,w2,w3)

(3)the utterance class cl; and

(4) class-specific unigram and bigram counts, e.g. C(cl,w1,w2)

Method

),(log WAP

Method

• A separate n-gram model would be

estimated for each class, and the

initial class-labeled arc would have

a prior probability of the class, P(C). • The class (W) is then selected

for word string W that maximizes

the joint probability of class and

word string W, i.e.

),(maxarg)( WCPWCc

)|()(maxarg CWPCPc

• We can control parameter updates to suit a particular objective by choosing the relevant features; or by choosing the gold standard annotation so that only features related to that objective are updated.

• Here we have two main objectives: having an accurate transcription and choosing the correct class for each utterance.

• The parameters can be independently or simultaneously optimized to achieve either one or both of the objectives.

• Gd : correct class and minimum error rate word string for the utterance x

• 1B the one-best annotation from L c 。 N c 。 l for the current models

• c(Gd) : class lable of the notation Gd

• w(Gd): word transcript of the notation Gd

objective classification error reduction

word-error rate

ignore transcription class label

y= c(Gd)w(1B) c(1B)w(Gd)

feature sets (3) and (4) (1),(2),(4)

all bigram counts come only from the best scoring transcription.

class labels coming only from what is best scoring

Perceptron

))),(),(((2

iiiiiPerc zxyxF

對偏微

)),(),(( iiii zxyx

iijiGCLM

)),(exp(

)),(exp()|(log)(

)),(exp(

)),(exp()(

)),(exp(

)),(exp(),(

對偏微

Experimental results

• Baseline results

• Perceptron

• GCLM

• This paper has investigated the effect of joint discriminative modeling on two objectives, classification and transcription error.

• On the classification side, there are three potential factors leading to the best performance:

(1) improvements in word accuracy provided by the model; and

(2) delaying the extraction of the 1-best hypothesis, i.e. using

word-lattice input rather than strings; and

(3) simultaneously optimizing parameters for classification and

transcription error reduction. • In the multi-label scenario, the first two factors provide 1% reduction

independently and 1.6% total reduction when combined. • Simultaneous optimization does not provide further improvement

over the two factors.

• simultaneous optimization of the parameters gives us as much improvement in word accuracy as we could get knowing the true class of the utterance, without penalizing TCErr.

• Using global conditional log-linear models further improved the performance.

• Independently optimizing the parameters yielded a 0.5% improvement in word accuracy and a 0.8% improvement in classification error over the perceptron algorithm.

Bang-Xuan Huang Department of Computer Science & Information Engineering

Documents

Bang!Bang! Storyboards

XUAN TIAN（田轩）

Bang Bang - Partitur.pdf

Xuan Sheng 玄参

XUAN SOLITAIRE - XUAN SOLITAIRE 03 - JAN-JUNE 2016

1/25/20051 CO internetworking (intra-domain + inter-layer) work in progress Malathi Veeraraghavan, Xuan Zheng, Zhanxiang Huang {mv5g, xuan, zh4c}@virginia.edu

About me About my school. me~~~~~ Name:Lele Huang University ： Hainan University Hometown ： Wenzhou;Zhejiang Hobby ： American TV series!!!!(The Big Bang

22. global city & city marketing prof. huang luixin - g2 - d khoa, t xuan, my y, d quynh

022001129 - Tung, Cao Xuan

Hoang Xuan Thiem_20121013

Smooth regularization of bang-bang optimal control · PDF fileSmooth regularization of bang-bang optimal control ... Smooth regularization of bang-bang optimal control problems

REVIEWARTICLE Shoudao HUANG Overview of condition ... · REVIEWARTICLE Shoudao HUANG, Xuan WU, Xiao LIU, Jian GAO, Yunze HE Overview of condition monitoring and operation control

Session 3 | Thursday, 2 May 2019 · hannah faruk alkaff huang qiqi lim shu xuan nicolus wong yu zheng ... hilwani binte mohammad abdul halim koh shao ming lakshmanasamy kalaivani

Bixuan Sang Chunshan Jiang Xuan Huang Jiaxun Wang

Xuan CI597D Final Project

NORTH POWER CORPORATION Electric Network Project ......Feb 06, 2013 · Tho Xuan, Tho Truong, Xuan Vinh, Xuan Thanh, Tho Hai, Xuan Chau, Xuan Bai, Tho Xuong, Xuan Truong, Quang Phu,

Bang- Xuan Huang Department of Computer Science & Information Engineering

From CTRnet to IDEAL (and Qatar, VT, SiteStory, UPS, …)wp.comminfo.rutgers.edu/nsfia/wp-content/uploads/... · CS6604 Spring 2014: Tianyu Geng, Wei Huang, Ji Wang, Xuan Zhang 02/28

Xuan Xuan Qijing - Fairbairn

Bang Bang Text