White Paper Phonetic Search Tech

Embed Size (px)

Citation preview

  • 7/30/2019 White Paper Phonetic Search Tech

    1/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding.

    whIte pAper

    pnic Sac tcnlg

    A Whitepaper by Nexidia, Inc.

  • 7/30/2019 White Paper Phonetic Search Tech

    2/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 2

    whItepAper

    pnic Sac tcnlg

    Cig Nic

    Copyright 2004-2009, Nexidia Inc. All rights reserved.

    This manual and any sotware described herein, in whole or in part may not

    be reproduced, translated or modifed in any manner, without the prior written

    approval o Nexidia Inc. Any documentation that is made available by Nexidia Inc.

    is the copyrighted work o Nexidia Inc. or its licensors and is owned by Nexidia Inc.

    or its licensors. This document contains inormation that may be protected by one

    or more U.S. patents, oreign patents or pending applications.

    trADeMArKS

    Nexidia, Enterprise Speech Intelligence, Nexidia ESI, the Nexidia logo, and

    combinations thereo are trademarks o Nexidia Inc. in the United States and other

    countries. Other product name and brands mentioned in this manual may be the

    trademarks or registered trademarks o their respective companies and are hereby

    acknowledged.

  • 7/30/2019 White Paper Phonetic Search Tech

    3/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 3

    whItepAper

    pnic Sac tcnlg

    Cnsqunl, muc is daa as unavailabl analsis, suc as millins us call cn calls cdd v

    a a a acivd lgal asns. Using a m adiinal aac, a v small amun audi ma b lisnd ,

    bu in an ad-c mann, suc as andm audis b call cn manags, lisning vaius badcass. tagd sacing,v, is dicul. I is audi daa asil sacabl, man alicains uld b ssibl, suc as: viing nl

    calls a m s ciia, ming nd analsis acss usands us cusm calls, sacing an ni nscas

    nd xac lcain a cain ic is discussd and man uss.

    t dicul in accssing inmain in ms audi da is a unlik sm badcas mdia, clsd caining is n

    availabl. Fu, man-mad anscis a xnsiv gna, and limid in i dsciin. Audi sac basd n

    sc--x cnlg is n scalabl, dnds n igl aind dicinais and gnas a ibiiv al cs

    nsi. wa is ndd is an alna aac.

    In is a, summaiz i k in sacing audi and xamin caacisics vaius mds. w n

    induc and dscib a b aac knn as nic-basd sac, dvld b sac gu a Nxidia in

    cnjuncin i Ggia Insiu tcnlg. pnic-basd sac is dsignd xml as sacing ug

    vas amuns mdia, alling sac ds, ass, jagn, slang and ds n adil und in a sc--x

    dicina. Bl vid a dsciin Nxidias cnlg and n discuss accuac nic sac and

    nall sn cun alicains cnlg isl.

    Cnac cns/nis, ic mdia, lgal/audi discv and gvnmn alicains a aas Nxidia as bn

    succssull alid.

    pi wk in Audi Sac

    rival inmain m audi and sc as bn a gal man sacs v as n as. t simls

    sluin is blm uld b us Lag Vcabula Cninuus Sc rcgniin (LVCSr), m im alignmn,

    and duc an indx x cnn alng i im sams. LVCSr is sucinl mau a lbxs a ublicl availabl

    suc as htK (m Cambidg Univsi, england), ISIp (Mississii Sa Univsi, USA), and Sinx (Cangi Mlln

    Univsi, USA) as ll as a s cmmcial ings. Muc imvd manc dmnsad in cun LVCSr

    ssms cms m b linguisic mdling [Juask] limina squncs ds a a n alld iin

    languag. Ununal, d as a sldm z.

    Inducin

    From call centers to broadcast news programs, the quantity of digital les being created is

    growing quickly and shows no signs of slowing. While valuable information may exist in the

    audio of these les, there has historically been no effective means to organize, search and

    analyze the data in an efcient manner.

  • 7/30/2019 White Paper Phonetic Search Tech

    4/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 4

    whItepAper

    pnic Sac tcnlg

    t nd b aumaic ival audi daa as md mulain daabass scicall s is caabili

    [Ga]. Als, a saa ack as bn sablisd skn dcumn ival iin annual treC (tx rival

    Cnnc) vn [Gal]. An xaml can b sn in [Jnsn]. In is sac, a ansciin m LVCSr as ducd

    n NISt hub-4 Badcas Ns cus. wl snnc quis a sd, and ansciin is sacd using inllign

    x-basd inmain xacin mds. Sm insing daa m is ss a d as ang m 64%

    20%, dnding n LVCSr ssm usd, and clsd caining as a ugl 12%. wil sc cgniin as

    imvd sinc s suls, imvmn as bn incmnal.

    Ng and Zu [Ng] cgnizd nd nic sacing b using subd unis inmain ival. Alug

    nic as ig (37%) and manc ival ask as l cmad LVCSr mds, ga mis

    as aniciad b aus.

    In LVCSr aac, cgniz is anscib all inu sc as a cain ds in is vcabula. Kd

    sing is a din cniqu sacing audi scic ds and ass. In is aac, cgniz is nl

    cncnd i ccuncs n kd as. Sinc sc singl d mus b cmud (insad

    ni vcabula), muc lss cmuain is quid. tis as v iman al al-im alicains suc as

    suvillanc and aumain a-assisd calls [wiln] [wld].

    An advanag kd sing is nial an n vcabula a sac im, making is cniqu usul in

    aciv ival. tis cniqu, v, is inadqua al-im xcuin. wn sacing ug ns undds

    INDUStry

    Contact Centers/Enterprise

    BeNeFItS FroM phoNetIC SeArCh

    > Improved customer interactions

    > Deeper business intelligence

    > Operational efciencies

    Rich Media > Large amounts o long orm content is searchable

    > Automated categorization and fltering

    > Synchronize stories with videos

    > Ad targeting

    > Easily monetized content

    Legal/Audio Discovery > Corporate compliance

    > Litigation support

    > Fast and accurate audio discovery

    Government > Audio search

    > Public saety

    > Standards compliance

  • 7/30/2019 White Paper Phonetic Search Tech

    5/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 5

    whItepAper

    pnic Sac tcnlg

    Figure 1

    Nexidia High-Speed

    Phonetic Search Architecture

    usands us acivd audi daa, scanning mus b xcud man usands ims as an al-im.

    t aciv is gal, a n class kd ss as bn dvld a ms saa indxing and sacing

    sags. In ding s, sac sds a a sval usand ims as an al im av bn succssull acivd.

    t dminan aacs av bn as-indxing mds [Saukkai] and nic laic mds [Jams] and

    cmbinains [yu]. In s aac, sd is acivd b gnaing a dsciin sc signal using

    a subs sub-d dscis. ts dscis a usd na sac sac a ival im. In scnd

    aac, sc is indxd duc a laic likl nms a can b sacd quickl an givn nm

    squnc. In all s mds, accuac as bn sacicd sd.

    t Nxidia hig-Sd pnic Sac engin

    w n induc an aac nic sacing, illusad in Figu 1. tis ig-sd algim [Clmns

    al. 2001a; Clmns al. 2001b; Clmns al. 2007; U.S. ans 7,231,351; 7,263,484; 7,313,521; 7,324,939;

    7,406,415] cmiss assindxing and sacing. t s as indxs inu sc duc a nic

    sac ack and is md nl nc. t scnd as, md nv a sac is ndd a d as,

    is sacing nic sac ack. onc indxing is cmld, is sac sag can b ad an numb

    quis. Sinc sac is nic, sac quis d n nd b in an -dnd dicina, us alling sacs

    nams, n ds, misslld ds, jagn c. N a nc indxing as bn cmld, iginal mdia a

    n invlvd a all duing sacing and sac ack culd b gnad n igs-quali mdia availabl imvd

    accuac ( xaml: -la audi ln), bu n audi culd b lacd b a cmssd snain

    sag and subsqun laback ( xaml: GSM) aads.

  • 7/30/2019 White Paper Phonetic Search Tech

    6/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 6

    whItepAper

    pnic Sac tcnlg

    Indxing and acusic mdl

    t indxing as bgins i ma cnvsin inu mdia (s ma mig b Mp3, ADpCM, Quicktim, c.)

    in a sandad audi snain subsqun andling (pCM). tn, using an acusic mdl, indxing ngin scans

    inu sc and ducs csnding nic sac ack. An acusic mdl jinl sns caacisics

    b an acusic cannl (an nvinmn in ic sc as ud and a ansduc ug ic i as cdd)

    and a naual languag (in ic uman bings xssd inu sc). Audi cannl caacisics includ: qunc

    sns, backgund nis and vbain. Caacisics a naual languag includ gnd, dialc and accn

    sak.

    Nxidia icall ducs acusic mdls ac languag:

    a mdl mdia i ig samling as, gd signal--nis ais, and m mal, asd sc; and

    a mdl mdia m a cmmcial ln nk, i landlin cllula ands,

    imizd m snanus, cnvsainal sc ln calls.

    Nxidia sus m an 30 languags including:

    Duc

    englis (N Amican, UK, and Ausalian)

    Fnc (euan, Canadian)

    hindi

    Gman

    Jaans

    Kan

    Mandain

    russian

    Sanis (Lain Amican, Casilian)

    tai

    Addiinal languags a cnsanl in dvlmn. I icall aks lss an a ks dvl a languag ack

    a n languag.

  • 7/30/2019 White Paper Phonetic Search Tech

    7/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 7

    whItepAper

    pnic Sac tcnlg

    pnic sac ack

    t nd sul nic indxing an audi l is caing a pnic Audi tack (pAt l)a igl cmssd

    snain nic cnn inu sc. Unlik LVCSr, s ssnial us is mak ivsibl

    (and ssibl incc) bindings bn sc sunds and scic ds, nic indxing ml ins liklid

    nial nic cnn as a ducd laic, ding dcisins abu d bindings subsqun sacing as.

    pAt ls a siml ls a can b ad as madaa, assciad and disibud i iginaing mdia sgmns,

    ducd in n nvinmn, sd in daa bass, ansmid via nks, and sacd in an nvinmn. t pAt

    l gs in siz inal lng in im suc mdia l, a aund 3.7 MB u, quivaln a bi a

    8.6 kbs, i.., 2/3 a GSM ln audi (13 kbs) 1/15 a ical Mp3 (128 kbs).

    KeyworD pArSING

    t sacing as bgins i asing qu sing, ic is scid as x cnaining n m:

    ds ass (.g., psidn Sum Cu Jusic),

    nic sings (.g., _B _Iy _t _Uw _B _Iy, six nms sning acnm B2B),

    mal as (.g., bain canc &15 cll n, sning ass skn iin 15 scnds ac ).

    A nic dicina is ncd ac d iin qu m accmmda unusual ds (s nunciains

    mus b andld sciall givn naual languag) as ll as v cmmn ds ( ic manc imizain is

    il). An d n und in dicina is n cssd b cnsuling a slling--sund cnv a gnas

    likl nic snains givn ds ga.

    SeArCh AND reSULtS LIStS

    A ds, ass, nic sings and mal as iin qu m a asd, acual sacing cmmncs.

    Mulil pAt ls can b scannd a ig sd duing a singl sac likl nic squncs (ssibl saad b

    ss scid b mal as) a clsl mac csnding sings nms in qu m. rcall a

    pAt ls ncd nial ss nms, n ivsibl bindings sunds. tus, macing algim is babilisic

    and uns mulil suls, ac as a 4-ul:

    pAt Fil ( idni mdia sgmn assciad i uaiv i)

    Sa tim os (bginning qu m iin mdia sgmn, accua n undd a scnd)

    end tim os (axima im s nd qu m)

    Cndnc Lvl (a qu m ccus as indicad, bn 0.0 and 1.0)

  • 7/30/2019 White Paper Phonetic Search Tech

    8/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 8

    whItepAper

    pnic Sac tcnlg

    evn duing sacing, ivsibl dcisins a snd. rsuls a siml numad, sd b cndnc lvl, i

    ms likl candidas lisd s. pscssing suls lis can b aumad. examl sagis includ ad slds

    (.g., ign suls bl 90% cndnc), ccunc cuning (.g., a mdia sgmn gs a b sc v addiinal

    insanc qu m) and naual languag cssing (ans nab ds and ass dning smanics).

    tical b sac ngins siv un mulil suls n s ag s a us can quickl idni n

    suls as i dsid cic. Similal, an cin us inac can b dvisd squnc aidl ug a nic

    sac suls lis, lisn bif ac n, dmin lvanc and nall slc n m uancs a m

    scic ciia. Dnding n availabl im and imanc ival, lis can b usd as dl as ncssa.

    StrUCtUreD QUerIeS

    In addiin ad-c sacs, Nxidia vids a m sisicad cnlg assis i cnxual sacs: a sucud

    qu. A sucud qu is simila a ni sa gamma a uld b ducd an aumaic sc cgniin

    ssm. examls as a AND, or, and ANDNot. Du scial dmain sac, sval lul xnsins a

    als vidd, suc as aacing im inds as. Simila Nxidias sandad ad-c nic sac, b scs

    and im ss a und. B cnsucing cmlx quis, cusms a abl asil gna dcumn classis in

    addiin jus dcing d as ccuncs. An xaml mig b idni man calls in a call cns aciv

    discuss blms i a ba. Sucud quis a siml i and av xssiv cau cmlx

    Blan and mal lainsis, as sn in lling xaml:

    Cun = or( cun, cica, ba)

    rsac = BeFore_3( l m, or( cck, d sm sac)))

    pblm = or( Im aaid, ununal, rsac)

    QUery = AND_10( Cun, pblm)

    ADVANtAGeS oF phoNetIC SeArChING

    t basic acicu nic sacing s sval k advanags v LVCSr and cnvninal d sing:

    Sd, accuac, scalabili. t indxing as dvs is limid im allmn nl cagizing inu sc sunds

    in nial ss nmsa an making ivsibl dcisins abu ds. tis aac svs ssibili

    ig accuac s a sacing as can mak b dcisins n snd i scic qu ms. Als,

    acicu saas indxing and sacing s a indxing nds b md nl nc (icall duing mdia

    ings) and laivl as ain (sacing) can b md as n as ncssa.

    on vcabula. LVCSr ssms can nl cgniz ds und in i lxicns. Man cmmn qu ms (suc as

    scializd minlg and nams l, lacs and ganizains) a icall mid m s lxicns (al k

    m small nug a LVCSrs can b xcud cs civl in al-im, and als bcaus s kinds qu ms a

    nabl unsabl as n minlg and nams a cnsanl vlving). pnic indxing is uncncnd abu suc linguisic

    issus, mainaining cmll n vcabula (, as m accual, n vcabula a all).

  • 7/30/2019 White Paper Phonetic Search Tech

    9/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 9

    whItepAper

    pnic Sac tcnlg

    L nal n ds. LVCSr lxicns can b udad i n minlg, nams, and ds. hv, is

    xacs a sius nal in ms cs nsibcaus ni mdia aciv mus n b cssd ug

    LVCSr cgniz n ds (an ain a icall xcus nl sligl as an al im a bs). Als,

    babiliis nd b assignd n ds, i b gussing i qunc cnx b aining a languag

    mdl a includs n ds. t dicina iin nic sacing acicu, n and, is cnsuld

    nl duing sacing as, ic is laivl as cmad indxing. Adding n ds incus nl an sac,

    and i is n unncssa add ds, sinc slling--sund ngin can andl ms cass aumaicall, uss

    can siml n sund-i-u vsins ds.

    pnic and inxac slling. p nams a aiculal usul qu msbu als aiculal dicul LVCSr,

    n nl bcaus ma n ccu in lxicn as dscibd abv, bu als bcaus n av mulil sllings

    (and an vaian ma b scid a sac im). wi nic sacing, xac slling is n quid. F xaml, a

    munainus gin in Nw Czcslvakia can indd b lcad b sciing Sudnland, bu Su Dan Land ill

    k as ll. tis advanag bcms cla i a nam a can b slld Qadda, Kadda, Quada, Kadda,

    Kadan ic culd b lcad b nic sacing.

    Us-dmind d sac. I a aicula d as is n skn clal, i backgund nis ins a

    a mmn, n LVCSr ill likl n cgniz sunds ccl. onc a dcisin is mad, cc inain

    is lssl ls subsqun sacs. pnic sacing v uns mulil suls, sd b cndnc lvl.

    t sunds a issu ma n b s (i ma n vn b in n 100), bu i is v likl in suls lis

    sm, aiculal i sm in d as is laivl unimdd b cannl aiacs. I nug im

    is availabl, and i ival is sucinl iman, n a mivad us (aidd b an cin uman inac) can

    dill as dl as ncssa. tis caabili is siml unavailabl i LVCSr.

    Amnabl aalll xcuin. t nic sacing acicu can ak ull advanag an aalll cssing

    accmmdains. F xaml, a cmu i dual csss can indx ic as as. Addiinall, pAt ls can b

    cssd in aalll b banks cmus sac m mdia uni im ( sac acks can b licad in

    sam imlmnain andl m quis v sam mdia).

    CUrreNt IMpLeMeNtAtIoN oF phoNetIC SeArChING

    Nxidia vids a ang duc ings su nds a id ang nvinmns. t ms basic m,

    calld Nxidia wkbnc, is a C++ lki a vids basic uncinali indxing and sacing n mdia l

    daa sams. t kbnc quis uss dvl i n nd--nd alicain. An xnsiv s saml cd

    is vidd assis uss in quickl adding nic-basd sac uncinali i alicains.

    t Nxidia enis Sc Inllignc (eSI) sluin is a ull sv-sd alicain a can b cngud

    aumaicall ings mdia ls m mulil sucs, sac an numb us-dnd m liss and quis, and

    analz s suls saisical ans. Iniiall dsignd as ingain in a cmmcial call cn, Nxidia

    eSI alls call cn as asil dmin sci cmlianc saisics, mni ic nds v im, and dill

    dn in call acivs scic ccunc dsid vns, all using an inuiiv b inac.

  • 7/30/2019 White Paper Phonetic Search Tech

    10/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 10

    whItepAper

    pnic Sac tcnlg

    Nxidia als s Nxidia eSI Dvls ediin (De), a b svics lki all a cusm-buil alicain dicl

    cnl and adminis an eSI insallain. Sinc eSI De lki uss b svics, alicains ma b dvld using

    viuall an dvlmn nvinmn, suc as Java, Visual Basic, c.

    o Nxidia ducs includ AudiFind, a sandaln dsk sluin idal -discv in lgal mak and audi

    nsics in gnal; Languag Assss, a b-basd sluin a aumaicall asssss nunciain and func call

    cn agn alicans; and a duc sui dsignd ic mdia mak a includs aumaic agging vid asss.

    All Nxidia ducs a dsignd vid ig manc b indxing and sac. on a ical 3.0 Ghz Dual pcss

    Dual C sv, mdia ls a indxd bn 82 and 340 ims as an al-im. onc pAt ls a ladd m

    disk in mm (rAM), sac sds v 1.5 millin ims as an al-im can b acivd ( quivalnl, m an

    400 us audi sacd in a scnd). t ngin is dsignd ak maximum advanag a muli-css ssm,suc a a dual css bx acivs nal dubl ugu a singl css cnguain, i minimal vad

    bn csss. Cmad alnaiv LVCSr aacs, Nxidia nic-basd sac ngin vids a lvl

    scalabili n acivabl b ssms.

    t Nxidia ngin cms i buil-in su a id vai cmmn audi mas, including pCM, -la, A-la,

    ADpCM, Mp3, Quicktim, wMA, g.723.1, g.729, g.726, Dialgic VoX, GSM and man s. Nxidia als vids a

    amk su cusm l-mas and dvics, suc as dic nk ds and ia cdcs, ug

    a vidd lug-in acicu.

    pmanc Nxidia pnic Sac

    t a k manc caacisics Nxidias pnic Sac: accuac suls, indx sd and sac

    sd. All a iman n valuaing an audi sac cnlg. tis scin ill dscib ac s in dail

    Nxidia nic-basd ngin.

    reSULt ACCUrACy

    pnic-basd sac suls a und as a lis uaiv i lcains, in dscnding liklid d. As a us gsss

    u dn is lis, ill nd m and m insancs i qu ccuing. hv, ill als vnuall

    ncun an incasing amun als alams (suls a d n csnd dsid sac m). tis manc

    caacisic is bs sn b a cuv cmmn in dcin : rciv oaing Caacisic cuv, roC cuv,

    sn in Figu 2 and Figu 3.

    t gna is cuv, n nds ximnal suls m sac ngin ( dd lis uaiv is) and idal

    suls s s (acquid b manual vi and dcumnain s daa). F audi sac, idal s is

    vbaim anscis a as skn in audi. F a singl qu, s numb acual ccuncs in idal

    ansci is cund. t roC cuv bgins a 0,0 in n ga Fals Alams hu vsus pbabili Dcin.

    rsuls m sac ngin a n xamind, bginning m lis. wn a uaiv i in lis macs

    ansci, dcin a incass, as cnag u ccuncs dcd as jus gn u ( cuv

  • 7/30/2019 White Paper Phonetic Search Tech

    11/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 11

    whItepAper

    pnic Sac tcnlg

    gs u). wn i is n a mac, als alam a n incass ( cuv n mvs ig). tis cninus

    unil als alam a acs a -dnd sld. F an singl qu in gnic sc, is cuv nmall as v

    ins, sinc sam as ill nl an a ims, unlss sam ic is bing discussd v and v in

    daabas. t duc a maningul roC cuv, usands quis a sd i suls avagd g, gnaing

    sm, and saisicall signican, roC cuvs.

    t a maj caacisics a ac babili dcin an givn qu.

    1 audi bing sacd; and

    2 lng and nm cmsiin sac ms mslvs.

    t addss s issu, Nxidia vids languag acks ac languag, n dsignd sac badcas-qualimdia and an ln-quali audi. t roC cuvs N Amican englis in badcas and ln a

    sn in Figus 2 and 3 scivl.

    Figure 2

    ROC Curves for the North

    American broadcast language pack

  • 7/30/2019 White Paper Phonetic Search Tech

    12/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 12

    whItepAper

    pnic Sac tcnlg

    F xaml, using N Amican englis badcas languag ack and a qu lng 1215 nms, u

    can xc, n avag, nd 85% u ccuncs, i lss an n als i 2 us mdia sacd.

    t Nxidia ngin vids an alicain fxibili cs nsu a ig babili dcin, b accing

    suls i a mda cndnc sc, duc als alams (uaiv suls ill av a ig babili bing

    an acual dsid sul), b aising sc sld and nl accing s i ig cndnc scs.

    In a d-sing ssm suc as Nxidia, m nms in qu man m disciminaiv inmain is availabl a

    sac im. As sn b u cuvs in gus sning u din gus qu lngs, dinc can

    b damaic. Funal, a an s, singl d quis (suc as n ), ms al-ld sacs a

    nams, ass, insing sc a sn lng nm squncs. .

    F badcas suls, s s is a n-u slcin ABC, CNN, and nscass, ssinall anscibdand ud b Linguisic Daa Cnsium (LDC). F ln, s s is a 10-u subs Sicbad and

    Sicbad Cllula ca, als availabl m LDC. Qu ms gnad b numaing all ssibl d

    and as squncs in anscis, and andml csing aund n usand m is s.

    Figure 3

    ROC Curves for the North

    American telephony language pack

  • 7/30/2019 White Paper Phonetic Search Tech

    13/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 13

    whItepAper

    pnic Sac tcnlg

    Figure 4

    Search speed, in hours of media searched per

    second of CPU time, for a range of query lengths

    INDeXING SpeeD

    An signican mic Nxidias nic sac is indxing sd (sd a ic n mdia can b mad sacabl).

    tis is a cla advanag Nxidia, as ngin ingss mdia v aidl. Fm call cns i undds sas, mdia

    acivs i ns usands us, andld dvics i limid CpU and sucs, is sd is a ima

    cncn, as is las dicl inasucu cs.

    Indxing quis a laivl cnsan amun cmuain mdia u, unlss a aicula audi sgmn is msl

    silnc, in ic cas indxing as a vn ga. In s-cas scnai a call cn badcas cding

    a cnains msl nn-silnc, ings sds a sv-class pC a givn bl in tabl 1.

    ts sds indica a indxing im 1,000 us mdia is lss an 1 u al im. pu an a,

    a singl sv a ull caaci can indx v 30,000 mdia da.

    ts suls a audi sulid in lina pCM -la ma indxing ngin. I audi is sulid in an ma

    suc as Mp3, wMA, GSM, ill b a small amun ma-dndn vad dcd cmssd audi.

    SeArCh SpeeD

    A nal manc masu is sd a ic mdia can b sacd nc i as bn indxd. t main acs infunc

    sd sacing. t ms iman ac is pAt ls a in mm n disk. onc an alicain

  • 7/30/2019 White Paper Phonetic Search Tech

    14/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 14

    whItepAper

    pnic Sac tcnlg

    quss a sac ack b ladd (i i xcs i b ndd sn), ls un s sac a ack, Nxidia

    sac ngin ill lad i in mm. An subsqun sacing ill us is in-mm vsin, gal sding u n

    sam mdia is sacd mulil ims.

    A scnd ac infuncing sac sd is lng, in nms, d as in qu. S quis un

    as, as a calculains mak innal sac ngin.

    tabl 2 bl ss sac sds a ail avag (12 nms lng) qu v a lag s in-mm pAt ls,

    xcud n a sv-class pC.

    Alicains pnic Sac

    t nic sac cnlg snd in is a as alad und man alicains and is alicabl man m.

    A bi summa cun and nial uss is:

    Call cn daa mining. Aumaicall sac cdd acivs in call-cns usul and niall abl

    inmain, suc as call nd analss, nding blm aas in IVrs, aud dcin, and uss.

    SeArCh SpeeD*

    667,210

    SerVer UtILIZAtIoN

    > 12.5% (single thread, only one CPU core used)

    5,068,783 > 100% (8 threads, one thread per CPU core)

    Table 1 Search speed (* in times faster than real-time) for a 12-phoneme query on a 2-processor, 4-core server

    (Dell PowerEdge 2950, 2 x 3.16 GHz X5460 Quad Core, 4 GB RAM, 2 x 6 MB cache, 1.33 GHz FSB)

    INDeXING SpeeD*

    190

    SerVer UtILIZAtIoN

    > 12.5% (single thread, only one CPU core used)

    1,306 > 100% (8 threads, one thread per CPU core)

    Table 1 Indexing Speed (*in times faster than real time) Indexing speed on a 2-processor, 4-core server

    (Dell PowerEdge 2950, 2 x 3.16 GHz X5460 Quad Core, 4 GB RAM, 2 x 6 MB cache, 1.33 GHz FSB)

  • 7/30/2019 White Paper Phonetic Search Tech

    15/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 15

    whItepAper

    pnic Sac tcnlg

    Call cn quali cnl. rduc xnss assciad i manual vsig CSr sci cmlianc. Fu, all

    sci cmlianc analsis acss all calls, all sas a cn, a an manual samling a small cnag.

    Sacabl vic mail. Man l av bgun using i mail lds as i ling ssm, kning x sac can lad

    m dsid addsss, ns, discussins, and inmain. pnic sac n alls simila caabiliis vicmail.

    ral-im mdia sac. Nxidias nic sac can un in mni md un suls i lss an 1 scnd

    lanc n mniing u 1,000 simulanus audi sams sv.

    Acivd mdia sac. wi Nxidias nic sac, i is ssibl s nd a dcas, lcu, gam ins,

    and scnd, immdial jum in in cding alking abu dsid ic.

    Nain, dsiin and invi su. In scnais xnsiv ansciin is n availabl, Nxidias as indxingand xml as sac alls k maks b quickl und.

    Sacabl ns andld uss. Nxidias sac is ligig nug asil un n andld dvics. Quick ns

    n lng av b in dninsad, n can jus sac audi isl.

    wd as dcin. Ligig sac can asil nam-dialing, cmmand-and-cnl, small alica-

    ins cunl andld b small sc cgniin ngins.

    Cnclusins

    tis a as givn an vvi nic sac cnlg dvld a Nxidia. t md baks sacing in

    sags: indxing and sacing. t indx sag ans nl nc mdia l, and is xml as, a m an 1,000as an al-im n sandad pC ada. ta l can n b sacd indndnl an numb ims, a a a

    m an 5,000,000 ims as an al im. Sac quis can b ds, ass, vn sucud quis a all

    as suc as AND, or, and im cnsains n gus ds. Sac suls a liss im ss in ls, i an

    accmaning sc giving liklid a a mac qu and a is im.

    pnic sacing as sval advanags v vius mds sacing audi mdia. B n cnsaining nun-

    ciain sacs, an nam, slang, vn ds a av bn inccl slld can b und, cmll aviding

    u--vcabula blms sc cgniin ssms. pnic sac is als as. F dlmns suc as call

    cns i ns usands us audi da, dcisin n slcing a subs analz nd n b mad, sinc

    i vn mds sucs all cdings can b indxd sac. Unlik aacs, Nxidias sac cnlg

    is v scalabl, alling as and cin sacing and analsis xml lag audi acivs.

  • 7/30/2019 White Paper Phonetic Search Tech

    16/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m lling Unid Sas ans: 7,231,351; 7,263,484; 7,313,521; 7,324,939; 7,406,415 and ans nding. 16

    whItepAper

    pnic Sac tcnlg

    rncs

    [Cang] e. I. Cang and r. p. Limann, Imving wdsing pmanc i Aiciall Gnad Daa, in pcdings

    Ieee Innainal Cnnc n Acusics, Sc and Signal pcssing, Alana, GA, Vl. 1, 283-286, 1996.

    [Ci] J. Ci, D. hindl, J. hisbg, I. Magin-Cagnllau, C. Kakaani, F. pia, A. Singal, and S. wiak, SCAN

    Sc Cnn Basd Audi Naviga: A Ssms ovvi, pcdings In. Cn. n Skn Languag pcssing, 1998.

    [Clmns al. 2001a] M. Clmns, p. Cadill, M. Mill, pnic Sacing Digial Audi, NAB Badcas engining

    Cnnc, Las Vgas, NV, Ail 2001.

    [Clmns al. 2001b] M. Clmns, p. Cadill, M. Mill, pnic Sacing vs. LVCSr: h Find wa yu rall wan

    in Audi Acivs, AVIoS, San Js, CA, Ail 2001.

    [Clmns al. 2007] M. Clmns and M. Gavalda, Vic/Audi Inmain rival: Minimizing Nd human

    eas, pcdings Ieee ASrU, K, Jaan. Dcmb 2007.

    [Gal] J. Gal, C. Auzann, and e. Vs, t treC Skn Dcumn rival tack: A Succss S,

    pcdings treC-8, 107-116, Gaisbug, MD, Nv. 1999.

    [Ga] D. Ga, Z. wu, r. McIn, and M. Libman, t 1996 Badcas Ns Sc and Languag-Mdl Cus,

    pcdings 1997 DArpA Sc rcgniin wks, 1997.

    [IBM] ://-4.ibm.cm/sa/sc, ViaVic.

    [Jams] D. A. Jams and S. J. yung, A Fas Laic-Basd Aac Vcabula Indndn wdsing, in pcdings

    Ieee Innainal Cnnc n Acusics, Sc and Signal pcssing, Adlais, SA, Ausalia, Vl. 1, 377-380, 1994.

    [Jnsn] S.e. Jnsn, p.C. wdland, p. Julin, and K. Sk Jns, Skn Dcumn rival treC-8 a Cambidg

    Univsi, pcdings treC-8, 197-206, Gaisbug, MD, Nv. 1999.

    [Juask] D. Juask and J. Main, Sc and Languag pcssing, pnic-hall, 2000.

    [Mics] X. huang, A. Ac, F. Allva, M. hang, L. Jiang, and M. Maajan, Mics winds higl Inllign Sc

    rcgniz: wis, pcdings ICASSp 95, vlum 1, 93-97.

    [Ng] K. Ng and V. Zu, pnic rcgniin Skn Dcumn rival, pcdings ICASSp 98, Sal, wA, 1998.

    [pilis] ://.sc.b.ilis.cm, Sc pal.

    [Saukkai] r. r. Saukkai and D. h. Ballad, pnic S Indxing Fas Lxical Accss, Ieee tansacins n pan

    Analsis and Macin Inllignc, Vl. 20, n. 1, 78-82, Janua, 1998.

    [Viag] ://.viag.cm, VidLgg and AudiLgg.

  • 7/30/2019 White Paper Phonetic Search Tech

    17/17

    2008 Nxidia Inc. All igs svd. All admaks a i sciv ns. Nxidia ducs a cd b cigs

    and n m ll ing Unid Sas ans 7 231 351 7 263 484 7 313 521 7 324 939 7 406 415 and ans nding 17

    whItepAper

    pnic Sac tcnlg

    [wiln] J. wiln, L. rabin, L. L, and e. Gldman, Aumaic rcgniin Kds in Uncnsaind Sc Using

    hiddn Makv Mdls, Ieee tansacins n Acusics, Sc, and Signal pcssing, Vl. 38, n. 11, 1870-1878,

    Nvmb, 1990.

    [wld] r. wld, A. Smi, and M. Sambu, t enancmn wdsing tcniqus, in pcdings Ieee

    Innainal Cnnc n Acusics, Sc and Signal pcssing, Dnv, Co, Vl. 1, 209-212, 1980.

    [yu] p yu, K. Cn, C. Ma, and F. Sid, Vcabula-Indndn Indxing Snanus Sc, Ieee tansacins

    n Sc and Audi pcssing, vlum 13, n. 5, Smb 2005.

    Nexidia Inc.

    3565 pidmn rad Ne

    Building t, Sui 400

    Alana, GA 30305

    404.495.7220 l

    404.495.7221 ax

    866.355.1241 ll- nxidia.cm