28
Keynote speech, ICUWB 2010, September 20-23, 2010, A Physiologically Produced Impulsive UWB signal: Speech Maria-Gabriella Di Benedetto University of Rome La Sapienza Faculty of Engineering Rome, Italy [email protected] http://acts.ing.uniroma1.it

A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

A Physiologically Produced Impulsive UWB signal: Speech

Maria-Gabriella Di Benedetto

University of Rome La Sapienza

Faculty of Engineering

Rome, Italy

[email protected]://acts.ing.uniroma1.it

Page 2: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Their waveforms have Impulse Radio wave shapes

They are UWB since their centre frequency is the zero frequency

Observation: many physiologically produced signals are impulsive in nature

… a coincidence?

Page 3: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Much of neural computation involves processing these neuronal spike trains

Spikes, Exploring the Neural Code (Computational Neuroscience)

Neuronal pulses

QRS-complex pulses

Speech pulses

Page 4: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Speech waveform

Presence of periodic vs. noise-like portions

Noise-like portions correspond to voiceless sounds: during production vocal folds do not vibrate

Periodic portions correspond to voiced sounds: during production, vocal folds vibrate

Page 5: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Speech production mechanism

Page 6: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Speech production model for voiced sounds

Linear System

Effect of vocal tract

Derivative of volume velocity U(t)

Speechsound pressure p(t)p(t)

Page 7: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Speech production model for voiceless sounds

Linear System

Speechsound pressure p(t)p(t)

Turbulence

Constriction

Derivative of volume velocity U(t)

Page 8: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Spectrum of a voiced sound

QuickTime and aᆰTIFF (Uncompressed) decompressor

are needed to see this picture.

By courtesy of Hari Arsikere UCLA Speech Processing and Auditory Perception Laboratory UCLA, USA, Prof. Abeer Alwan Director

Page 9: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Spectrum of a voiceless sound

QuickTime and aᆰTIFF (Uncompressed) decompressor

are needed to see this picture.

By courtesy of Hari Arsikere UCLA Speech Processing and Auditory Perception LaboratoryUCLA, USA, Prof. Abeer Alwan Director

Page 10: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

The model in the VOice CODER VOCODER

Source

Pitch period F0

Noise

Voiced/voiceless switch

Gain

Vocal tract

Based on analog vocoder, Homer W. Dudley, patent 1939

Page 11: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

VOCODER strongest limitation

The model is way too simplistic in the case of sounds with a mixed voiced-voiceless nature

Page 12: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Mixed-Excited VOCODER

This model is based on linear combination of periodic and noise excitation

GGpp

GGnn

x

x

Page 13: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

CELP VOCODER

The best multi-pulse is selected from a set stored in a codebook

Based on multi-pulse model presented by Atal and Remde, ICASSP, 1982

x

x

multi-pulsemulti-pulse

Used in GSM, UMTS and many others

But why “best” is “best” still remains to be understood

Page 14: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Spectrum of a mixed sound

QuickTime and aᆰTIFF (Uncompressed) decompressor

are needed to see this picture.

Tilt at high frequenciesPeriodicity loss at low frequencies

Aspirated soundAspirated sound[hiy[hiy]]

Page 15: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Vocal folds

Two-mass model of vocal folds

From Stevens, Acoustic Phonetics, The MIT Press, 2000

Lateral sections of vibrating vocal folds

Page 16: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

The LF model of the glottal source

Introduced by G.Fant et al. in 1985, refined by G. Fant, "The LF-model revisited.Transformations andfrequency domain analysis", in "STL-QPSR Journal", vol. 36, 119-156, 1995

Derivative of the glottal airflow

Looks like the transmitter antenna output: first derivative of a bell-shape pulse

Page 17: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Excitation signal at the glottis

ideali

stic

ideali

stic

Page 18: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Excitation signal at the glottis

realis

tic

realis

tic

Page 19: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Impulse Radio UWBPulse Position Modulation

From M.-G. Di Benedetto and G. Giancola, Understanding Ultra Wide Band Radio Fundamentals, Prentice Hall, 2004

Samples m(kTs) of an analog wave m(t) determine pulse position

m(kTs)

Ts

Page 20: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Impulse Radio UWBPulse Position Modulation

From M.-G. Di Benedetto and G. Giancola, Understanding Ultra Wide Band Radio Fundamentals, Prentice Hall, 2004

PxPPM

( f ) =Π( φ)

2

Τσ

1− Ω ( φ)2

+Ω ( φ)

2

Τσ

δ( φ −νΤσ

)ν=−ᆬ

+ᆰ

ᆬᆬ

ᆬᆰ

ᆬᆰ

W ( f ) = ω(σ)ε− ϕ2π φσδφ = ε− ϕ2π φσ

−ᆬ

+ᆰ

ᆬ = Χ −2π φ( )

where W(f) is the Fourier transform of the probability density w and coincides with the characteristic function of w computed in -2πf

w(s) is the probability density function of samples m(kTs) of a stationary continuous process m(t)

Page 21: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Impulse Radio UWBPulse Position Modulation

Page 22: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Experimental evidence

Synthesis of a vowel produced by one male and one female speakerSynthesis of a vowel produced by one male and one female speaker

H(z)H(z)Synthetic vowelSynthetic vowel

regular pulsesregular pulses

H(z)H(z)Synthetic vowelSynthetic vowel

irregular pulsesirregular pulses

Increasing Increasing % of pulse % of pulse jitterjitter

Page 23: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Experimental results

Synthesis of vowel [e] male speaker

Synthetic vowel no jitter

Synthetic vowel 30% jitter

Synthetic vowel 5% jitter

Synthetic vowel 10% jitter

Page 24: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Experimental results

Synthesis of vowel [a] female speaker

Synthetic vowel no jitter

Synthetic vowel 30% jitter

Synthetic vowel 5% jitter

Synthetic vowel 10% jitter

Page 25: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Conclusion

• Example of how UWB theory can help us understanding the structure of impulsive physiologically produced signals

• Interesting insights can be derived from what we know about properties of non-linear modulation in UWB

• Modeling production mechanisms in order to understand basic properties of physiologically produced signals

Page 26: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Challenging workframe

COST Action IC0902Cognitive Radio and Networking for Cooperative

Coexistence of Heterogeneous Wireless Networks

Chair: Maria-Gabriella Di Benedetto

http://newyork.ing.uniroma1.it/IC0902http://newyork.ing.uniroma1.it/IC0902

Page 27: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Riunione GTTI 2010, 23 giugno 2010, Brescia

Economic dimension

10 COSTcountries

3 non-COSTcountries

Estimated economic dimension: 44 Millionᆬfor the total duration of the Action

20 COSTcountries

5 non-COSTcountries

QuickTime e unᆬdecompressore sono necessari per visualizzare quest'immagine.

CyprusCyprus

Czech RepCzech Rep..

IrelandIreland

IsraelIsrael

LatviaLatvia

NorwayNorway

RomaniaRomania

SloveniaSlovenia

SwedenSweden

TurkeyTurkey

QuickTime e unᆰdecompressore sono necessari per visualizzare quest'immagine.

QuickTime e unᆰdecompressore sono necessari per visualizzare quest'immagine.

QuickTime e unᆰdecompressore sono necessari per visualizzare quest'immagine.

QuickTime e unᆰdecompressore sono necessari per visualizzare quest'immagine.

QuickTime e unᆰdecompressore sono necessari per visualizzare quest'immagine.

QuickTime e unᆰdecompressore sono necessari per visualizzare quest'immagine.

Participation of Participation of over 30over 30

countriescountries

Page 28: A Physiologically Produced Impulsive UWB signal: Speechacts.ing.uniroma1.it/Talks/icuwb2010_v1.pdf · Spikes, Exploring the Neural Code (Computational Neuroscience) Neuronal pulses

Keynote speech, ICUWB 2010, September 20-23, 2010, Nanjing, China.

Challenging workframe

COST Action IC0902Cognitive Radio and Networking for Cooperative

Coexistence of Heterogeneous Wireless Networks

Chair: Maria-Gabriella Di Benedetto

http://newyork.ing.uniroma1.it/IC0902http://newyork.ing.uniroma1.it/IC0902

EU FP7 Network of ExcellenceACROPOLIS

Advanced coexistence teChnologies ofr Radio OPtimisatiOn in Licensed and unlicensed Spectrum

October 1, 2010