Upload
ericly
View
220
Download
0
Embed Size (px)
Citation preview
7/26/2019 Lecture 1-Spring 2016 (1)
1/12
CS 582Intro to Speech Processing
Chuck Konopka [email protected]
M W 2:!":#5pm $%& '"(
)ecture I * &dministration+,rgani-ation &n Introduction to the /opic
Wed. #.2#.#5
7/26/2019 Lecture 1-Spring 2016 (1)
2/12
0rading Criteria
" home1ork assignments: #5
# take home midterm e3am: 2
# take home 4inal e3am: "# semester team proect: "5
$3tra Credit ,pportunities: 6p to #
7/26/2019 Lecture 1-Spring 2016 (1)
3/12
Maor /opics
Modeling &coustic /heor7 o4 Speech Production and Perception
9o1 1e model the speech and hearing processes
&coustic!Phonetics 9o1 1e model acoustic units o4 speech
/ime!;re
7/26/2019 Lecture 1-Spring 2016 (1)
4/12
S7lla?usWeeks #!8
Week Subject
1 Course Introduction /he >eall7 %ig Picture: What is ModelingA Bemonstration+)a?: 6sing the CS)6 /oolkit to ?uild a simple 1orking speech
recognition s7stem Selection o4 Semester Proect #!2 pages
2-3 The Big Picture /he ph7sical model o4 speech recognition: Speech production and perception Beri=ing a computational model o4 speech recognition 4rom the ph7sical model
4-6 !chine "e!rning Super=ised )earning 6nsuper=ised )earning Machine )earning )a? &n application o4 Matla? or a=a!?ased so4t1are to a learning
pro?lem
# idter$ re%ie& !nd e'!$ /ake!home midterm e3am Midterm report on Semester Proect progress #!2 pages
7/26/2019 Lecture 1-Spring 2016 (1)
5/12
S7lla?usWeeks (!#D
Week Subject
(-1) *idden !rko% ode+s /he E4amousF " lectures 9MM )a?: Simple implementations o4 ke7 9MM algorithms
11-14 S,eech ,re-,rocessing &n introduction to /ime!4re
7/26/2019 Lecture 1-Spring 2016 (1)
6/12
Speech Processing &pplications
$3amples o4 speech processing applications include: Speech recognition Bragon Bictate S&PI Microso4tGs Speech &PI CS>)6 /oolkit
Speech s7nthesis CS>)6 /oolkit
&/H/ atural oices Speech e44ects
pitch ?ending Chorus e44ects
0rammar modeling S7nthetic Shakespeare
Speaker recognition &coustic %iometrics &ccent recognition )anguage training
7/26/2019 Lecture 1-Spring 2016 (1)
7/12
>esources/hings 7ou 1ill need
/e3t?ook:
Speech &nd )anguage Processing 2nd EditionE
ura4sk7 H Martin Prentice 9all 2(
Matla?+,cta=e &udacit7 a=a CJJ etc.
arious papers to ?e announced
7/26/2019 Lecture 1-Spring 2016 (1)
8/12
/he Semester Proect
/he goal o4 the Semester Proect is to apply and generalizethe presented concepts ?7 de=eloping a
E%ig IdeaF in a team setting.
E%ig Ideas Some e3amples o4 Prior Semester Proects: S7nthetic Shakespeare
/he Cocktail Part7 $44ect
Concatenati=e Speech S7nthesis
Prosod7 Betection H S7nthesis
&ccent >ecognition
9armon7 0eneration
$motion >ecognition
S7nthetic %eatles %eetho=en etc.
&long the 1a7 7ouGll: Be=elop the E%ig IdeaF into something 7ou can implement
Be=elop research and 1riting skills Be=elop team ?uilding and coordination skills
7/26/2019 Lecture 1-Spring 2016 (1)
9/12
$3amples o4 Past Semester Proects
S7nthetic Shakespeare
/he Cocktail Part7 $44ect
Concatenati=e Speech S7nthesis
Prosod7 Betection H S7nthesis
&ccent >ecognition
9armon7 0eneration
$motion >ecognition S7nthetic %eatles %eetho=en etc.
7/26/2019 Lecture 1-Spring 2016 (1)
10/12
In Brie/
/his course 1ill introduce 7ou to the 4undamentals o4speech processing and ho1 these concepts can?e applied to other pro?lem domains.
7/26/2019 Lecture 1-Spring 2016 (1)
11/12
The Big Ide!
& per4ect understanding o4 ho1 1e understand speech
isnGt re
7/26/2019 Lecture 1-Spring 2016 (1)
12/12
uick %er%ie&
!ture5s ode+ WeGll ?egin 1ith a de4inition o4 a model. WeGll then take a
look at the ?iological models o4 speech production andperception that ser=e as the ?asis 4or the computationalmodels o4 speech.
Co$,ut!tion!+ ode+ ,nce 1e understand the Eatural ModelF 1eGll proceed to
de=elop a computational model.*o& WeGll de=elop a hierarch7 o4 the ?uilding ?locks o4 speech
and then ?uild a s7stem using these components. /hese elements are:
/he acoustic audio elements /he phonetic elements /he structure o4 speech E0rammarF /he meaning in speech i.e. EsemanticsF