1
[email protected], [email protected] Orange Labs, 2 avenue Pierre Marzin, 22307 Lannion, France We thank Telecom Bretagne, Q. Jin, X. Chen, S.Zarrad, F. Agez and A. Bolze for their contribution to the platform deployment. Ergonomics Expert Marketing Experts Technical Developper Hosting Provider Visual dashboards display pre-processed data according to pre-defined evaluation profiles. Ex: Dashboard dedicated to a high-level view on usability performances A summary of high-level KPIs provides a general view on the evaluated system with ”red-light indicators”. Links to more detailed charts or analysis tools are displayed next to each of them.

Sigdial poster mpowers_final

Embed Size (px)

Citation preview

Page 1: Sigdial poster mpowers_final

[email protected], [email protected] Labs, 2 avenue Pierre Marzin, 22307 Lannion, France

We thank Telecom Bretagne, Q. Jin, X. Chen, S.Zarrad, F. Agez and A. Bolze for their contribution to the platform deployment.

ErgonomicsExpert Marketing

ExpertsTechnical

Developpe

r

HostingProvider

Visual dashboards display pre-processed data according to pre-defined evaluation profiles. Ex: Dashboard dedicated to a

high-level view on usability performances

A summary of high-level KPIs provides a general view on the evaluated system with ”red-light indicators”. Links to more detailed charts or analysis

tools are displayed next to each of them.

Features‐Implementsdesignconven�onsinauser‐friendlyinterface

‐Ensuresallevaluator‐usersmanipulateauniquesharedcorpusofdata

‐Asharedsetofparametersforevalua�onsdesignandretrieval

‐Supportstheconvergenceoftheevalua�onprac�ces

‐Considersseveralanaly�calpointsofview

‐Supportsabothdata‐drivenandgoal‐drivenprocess

‐Business‐intelligencesolu�onbased

Context‐Noonesizefits‐all‐evalua�ontocoverallneeds

‐Nomadismofevalua�onapproaches

‐Needforcoopera�onbetweenstakeholders

MPOWERSaMultiPointofvieWEvaluationRefinementStudioforSpokenDialogueSystems

Evaluation"Processthroughwhichonedefines,obtainsanddeliversusefulpiecesofinforma�ontose�lebetween

thealterna�vepossibledecisions."DanielStufflebeam,1980.

.

A uniquecorpusof data for all

Ambitions‐Alignmentofevalua�onapproaches(notstandardiza�on)‐Enforcementofnormstodescribetheevalua�onprotocols,commonthinkingmodelsandterminologyforevaluatorstomaketheirproceduresexplicit

Screenshots

commonlydefined KPIs(ITU-T Rec.

sup.24)

Custom-made

dashboards

Let's Go!system corpus

(CMU)