Lecture 1-Spring 2016 (1)

  • Upload
    ericly

  • View
    220

  • Download
    0

Embed Size (px)

Citation preview

  • 7/26/2019 Lecture 1-Spring 2016 (1)

    1/12

    CS 582Intro to Speech Processing

    Chuck Konopka [email protected]

    M W 2:!":#5pm $%& '"(

    )ecture I * &dministration+,rgani-ation &n Introduction to the /opic

    Wed. #.2#.#5

  • 7/26/2019 Lecture 1-Spring 2016 (1)

    2/12

    0rading Criteria

    " home1ork assignments: #5

    # take home midterm e3am: 2

    # take home 4inal e3am: "# semester team proect: "5

    $3tra Credit ,pportunities: 6p to #

  • 7/26/2019 Lecture 1-Spring 2016 (1)

    3/12

    Maor /opics

    Modeling &coustic /heor7 o4 Speech Production and Perception

    9o1 1e model the speech and hearing processes

    &coustic!Phonetics 9o1 1e model acoustic units o4 speech

    /ime!;re

  • 7/26/2019 Lecture 1-Spring 2016 (1)

    4/12

    S7lla?usWeeks #!8

    Week Subject

    1 Course Introduction /he >eall7 %ig Picture: What is ModelingA Bemonstration+)a?: 6sing the CS)6 /oolkit to ?uild a simple 1orking speech

    recognition s7stem Selection o4 Semester Proect #!2 pages

    2-3 The Big Picture /he ph7sical model o4 speech recognition: Speech production and perception Beri=ing a computational model o4 speech recognition 4rom the ph7sical model

    4-6 !chine "e!rning Super=ised )earning 6nsuper=ised )earning Machine )earning )a? &n application o4 Matla? or a=a!?ased so4t1are to a learning

    pro?lem

    # idter$ re%ie& !nd e'!$ /ake!home midterm e3am Midterm report on Semester Proect progress #!2 pages

  • 7/26/2019 Lecture 1-Spring 2016 (1)

    5/12

    S7lla?usWeeks (!#D

    Week Subject

    (-1) *idden !rko% ode+s /he E4amousF " lectures 9MM )a?: Simple implementations o4 ke7 9MM algorithms

    11-14 S,eech ,re-,rocessing &n introduction to /ime!4re

  • 7/26/2019 Lecture 1-Spring 2016 (1)

    6/12

    Speech Processing &pplications

    $3amples o4 speech processing applications include: Speech recognition Bragon Bictate S&PI Microso4tGs Speech &PI CS>)6 /oolkit

    Speech s7nthesis CS>)6 /oolkit

    &/H/ atural oices Speech e44ects

    pitch ?ending Chorus e44ects

    0rammar modeling S7nthetic Shakespeare

    Speaker recognition &coustic %iometrics &ccent recognition )anguage training

  • 7/26/2019 Lecture 1-Spring 2016 (1)

    7/12

    >esources/hings 7ou 1ill need

    /e3t?ook:

    Speech &nd )anguage Processing 2nd EditionE

    ura4sk7 H Martin Prentice 9all 2(

    Matla?+,cta=e &udacit7 a=a CJJ etc.

    arious papers to ?e announced

  • 7/26/2019 Lecture 1-Spring 2016 (1)

    8/12

    /he Semester Proect

    /he goal o4 the Semester Proect is to apply and generalizethe presented concepts ?7 de=eloping a

    E%ig IdeaF in a team setting.

    E%ig Ideas Some e3amples o4 Prior Semester Proects: S7nthetic Shakespeare

    /he Cocktail Part7 $44ect

    Concatenati=e Speech S7nthesis

    Prosod7 Betection H S7nthesis

    &ccent >ecognition

    9armon7 0eneration

    $motion >ecognition

    S7nthetic %eatles %eetho=en etc.

    &long the 1a7 7ouGll: Be=elop the E%ig IdeaF into something 7ou can implement

    Be=elop research and 1riting skills Be=elop team ?uilding and coordination skills

  • 7/26/2019 Lecture 1-Spring 2016 (1)

    9/12

    $3amples o4 Past Semester Proects

    S7nthetic Shakespeare

    /he Cocktail Part7 $44ect

    Concatenati=e Speech S7nthesis

    Prosod7 Betection H S7nthesis

    &ccent >ecognition

    9armon7 0eneration

    $motion >ecognition S7nthetic %eatles %eetho=en etc.

  • 7/26/2019 Lecture 1-Spring 2016 (1)

    10/12

    In Brie/

    /his course 1ill introduce 7ou to the 4undamentals o4speech processing and ho1 these concepts can?e applied to other pro?lem domains.

  • 7/26/2019 Lecture 1-Spring 2016 (1)

    11/12

    The Big Ide!

    & per4ect understanding o4 ho1 1e understand speech

    isnGt re

  • 7/26/2019 Lecture 1-Spring 2016 (1)

    12/12

    uick %er%ie&

    !ture5s ode+ WeGll ?egin 1ith a de4inition o4 a model. WeGll then take a

    look at the ?iological models o4 speech production andperception that ser=e as the ?asis 4or the computationalmodels o4 speech.

    Co$,ut!tion!+ ode+ ,nce 1e understand the Eatural ModelF 1eGll proceed to

    de=elop a computational model.*o& WeGll de=elop a hierarch7 o4 the ?uilding ?locks o4 speech

    and then ?uild a s7stem using these components. /hese elements are:

    /he acoustic audio elements /he phonetic elements /he structure o4 speech E0rammarF /he meaning in speech i.e. EsemanticsF