18
Audio Audio Workgroup Workgroup Neuro-inspired Speech Neuro-inspired Speech Recognition Recognition

Audio Workgroup Neuro-inspired Speech Recognition

Embed Size (px)

Citation preview

Page 1: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

Neuro-inspired Speech RecognitionNeuro-inspired Speech Recognition

Page 2: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

Localization EffortLocalization Effort

Interaural Time Difference (ITD)

Estimated from time difference between spikes of two matching channels.

Interaural Intensity Difference (IID)

Difference of spike counts between two cochleae.

Azimuth: Combination of ITD and ILD

Page 3: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

Localization EffortLocalization Effort

Page 4: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

Relational Network (Simple)Relational Network (Simple)

X Y

Z

MM

X

M

Y

M

Z

m

Patches of neuronsEach measureone quantityBidirectionalrelations for feedback/feedforward

Page 5: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

Relational Network (example)Relational Network (example)

Input hereRelation specification

Relational feedback

RelationFeedback

Page 6: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

ASR Relational NetworkASR Relational Network

Cochlea

Delay

Phone Recognizer

Phone Recognizer

Word Recognizer

A patch of neurons(one of N output)

We don’t know how to represent time

Page 7: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

ASR AdvantagesASR Advantages

Not an HMM

Top-Down, Bottom-Up Hypothesis

Hallucinate

Page 8: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

Silicon CochleaSilicon Cochlea

Ganglion cells

Basilar membrane

highfrequency

lowfrequency

Inner hair cells

(van Schaik, Liu, 2004)

BASILAR MEMBRANE

INNER HAIR CELLS

GANGLION CELLS

Page 9: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

Silicon CochleaSilicon Cochlea

Tone raster plots

Vowel Rate Profiles

Page 10: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

Learning ChipLearning Chip

Architecture

Tone Rasters?

Vowel Rasters

Learning Algorithm

Alternative LearningStatistics

LeastSquares

Page 11: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

LSM RecognizerLSM Recognizer

Page 12: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

Infrastruture DifficultiesInfrastruture Difficulties

RemapperReplace with Matlab

Power ?

Sharing chips?

PC replacement

Page 13: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

FPAA/MoteFPAA/Mote

Page 14: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

Word RecognizerWord Recognizer

Four example raster plot (silence, A_, A_ with relational, AI)

Page 15: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

QuickTime™ and aTIFF (Uncompressed) decompressor

are needed to see this picture.

Software SimulationSoftware Simulation

Page 16: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

Behind the CurtainBehind the Curtain

Page 17: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup

Hardware OverviewHardware Overview

Cochlea

Cochlea

Remapper(in Matlab)

Learning

GiacomoGiacomo

PhonemeWord

skype

PCI-AER (for remapping)

PCI-AER (for remapping)

Page 18: Audio Workgroup Neuro-inspired Speech Recognition

Audio WorkgroupAudio Workgroup