33
updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/ goh kawai 2013-04-09 tue1 week1 spoken language corpora s316 Spoken language corpora Course overview

Updated 2013-04-07 03:20 utc [email protected] goh kawai 2013-04-09 tue1 week1 spoken language corpora s316 Spoken language corpora Course

Embed Size (px)

Citation preview

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

goh kawai2013-04-09 tue1 week1

spoken language corporas316

Spoken language corporaCourse overview

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

goh do this for tue1

bring and connect laptop, projector, network, bluetooth speaker, clicker

arrange desks, chairs show these slides, my website, glexa circulate roster sheet

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

make roster

write full name furigana email address

pass sheet

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

informed consent

your speech and actions may be recorded, archived and, without revealing your identity, used and made public for research and education purposes

if you disagree, I will neither record nor retaliate学生の言動を録音し、保存し、匿名としたう

えで研究と教育のために利用したり公開する可能性がある

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

contact info

office: office building room s304 email: [email protected] web: goh.kawai.com

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

goh's website

http://goh.kawai.com/ http://goh.cll.hokudai.ac.jp/

identical content hokudai site may be faster

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

instructor

Goh Kawai ( 河合 剛 かわい ごう ) born in Tokyo, raised in Toronto came to Sapporo in 2003-04

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

goh’s academic background

Univ of Tokyo BA linguistics, 1984

ICU MA educational technology, 1986

Stanford Univ linguistics (dropout)

Univ of Tokyo PhD information and communication

engineering, 1999

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

goh’s vocational background

Xerox Palo Alto Research CenterPalo Alto, CA

SRI International Menlo Park, CA

University of Tokyo Tokyo, Japan

University of California Santa Cruz Santa Cruz, CA

Oregon Health & Science University Beaverton, OR

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

goh’s interests

research spoken and written language

processing technology applied to language learning

personal interests flying, kayaking, cycling, snowshoeing,

amateur radio, sado (way of tea)

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

office hours

drop-in or email for appointment no phone calls

off campus see my website

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

class periods

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

grad school catalog blurb

担当分野/マルチメディア言語情報処理論 研究領域、学歴 ( 言語学学士、教育学修士、電子情報工学博

士 ) 、職歴 ( 研究所 2 社、大学 4 校 ) 、業績一覧、所属学会、授業資料、教え子の匿名コメント ( 全ての学部授業 ) などを web に掲載。メールで面会予約。電話不可。私の評価を元指導生に直接たずねるとよい。

言語情報処理、教育工学☆領域 言語学と情報処理技術を利用した非母語学習。☆手法 学習システムや教材を制作し、学習効果を定量的に評価する。☆指導方法 協同プロジェクトを共著論文にまとめる。☆修士条件 査読のある国際会議で論文発表。☆博士条件 後進の研究指導。☆指導生の発表先 音響学会、音声学会、教育工学会、 ASA, AAAL, Calico, Eurocall, Interspeech など。

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

alumni

平野宏子 東京大学 博士 ( 科学 )東北師範大学

歌代崇史 東京工業大学 博士 ( 工学 )北海学園大学

三角美樹 札幌開成高校 壽崎尚美 北海道立高校 片桐徳昭 札幌開成高校、博士 ( 学術 ) 見

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

undergraduate education

english language for freshmen online course instructor-led courses

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

english online

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

instructor-led course

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

pronunciation lunch

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

spoken language corpora course

acquire a specific practical skillnot theory lots of out-of-class work

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

objectives

re: spoken language corpora, explain: basic concepts (definitions, features) uses (analysis, engineering, learning) design and development strategies

re: speech analysis, perform: design and collect corpus label and analyze speech interpret analyses

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

prerequisites

phonetics and phonology sound system of English and/or Japanese IPA desirable

audio input and output using computers bring your laptop (Linux, Windows, Mac)

statistics mean, standard deviation

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

format of each class period

explain concepts and theory collect and analyze speech

learn software tools transcribe and analyze design corpus

learn about research and academia explain next week's assignment

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

grading

discussion and project

100% essential

participate in discussion during class propose and report your project

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

schedule

wk date activity

1 2013-04-09 install software

2 2013-04-16 transcribe speech

3 2013-04-23 record read speech

4 2013-05-07 record spontaneous speech

5 2013-05-14 design L1 script

6 2013-05-21 design L1 script

7 2013-05-28 design L2 script

8 2013-06-04 design L2 script

attendance mandatory

wk date activity

9 2013-06-11 propose project

10 2013-06-18 propose project

11 2013-06-25 report progress

12 2013-06-26 report progress

13 2013-07-02 report project

14 2013-07-09 report project

15 2013-07-16 critique

16 2013-07-23

probably no class (make up day)

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

courseware

everything online reading material lecture notes (including this presentation)

http://goh.kawai.com/ http://goh.cll.hokudai.ac.jp/

hokudai library catalog of our course's textbooks view online course offering ( シラバス )

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

Praat

http://www.praat.org/ built by researchers and engineers in

linguistics and speech processing updated frequently good support base Windows, Mac, Linux free

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

what can Praat do?

record and play speechdisplay waveforms, spectrograms, pitch and

more label speech at various levels phone, mora, syllable, word, phrase and

utterance levelsSIL fontsPraat in action

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

demo

view praat time waveform spectogram spectral slice

sound sources show praat vowels consonants pure tones (sinusoids)

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

readings

Jurafsky et al (2000) chapter 4

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

next week

install Praat TIMIT sentences

download from my website extract speech files from archive read files into Praat play speech view waveforms and spectograms label at the word level

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

slideshow

if there's time

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

one-stop website

http://goh.kawai.com/

link to glexa course material (these slides) contact form

updated 2013-04-07 03:20 utc [email protected] http://goh.kawai.com/

see you next week!

mailto:[email protected] http://goh.kawai.com/