107
Neural representation of musical pitch www.cariani.com Wednesday, February 11, 2009 Figure by MIT OpenCourseWare. HST.725 Music Perception and Cognition, Spring 2009 Harvard-MIT Division of Health Sciences and Technology Course Director: Dr. Peter Cariani

Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Neural representation of musical pitch

www.cariani.com

Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

HST.725 Music Perception and Cognition, Spring 2009Harvard-MIT Division of Health Sciences and TechnologyCourse Director: Dr. Peter Cariani

Page 2: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Pitch: the basis of musical tonality • Operational definition of pitch • Pitch of pure tones • Pitch of harmonic complexes at the fundamental • Pitch of the missing fundamental • Pitch of unresolved harmonics • Repetition pitch • Pitch salience • Relative vs. absolute pitch • Pitch circularity • Pitch: how many dimensions do we need? • Models of pitch: Place vs. temporal theories • Envelope vs. fine structure (Schouten & de Boer)

– Residue theories (incomplete filtering of unresolved harmonics) – Temporal autocorrelation theories (Licklider)

• Analytical (Helmholtz) vs. Gestalt (Stumpf) perspectives – Spectral pattern analysis/completion vs. dominant periodicity

Wednesday, February 11, 2009

Page 3: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Dimensions of auditory objects Dimensions of event perception

Auditory qualities and their organization Unitary events & their organization

Objects: Quasi-stationary Events: abrupt perceptualassemblages of qualities discontinuities

Timbre

Loudness

Location Spatial Dimensions

Pitch

TEMPORAL EVENT

STRUCTURE

Timing & order (metric, sequence)

Duration

FUSION/SEPARATION FUSION/SEPARATIONCommon onset & harmonic structure => fusion Common onset, offset => fusion

Different F0s, locations, onset => separation Diff. meters, pitch, timbre => separationPOLYPHONY STREAMS, POLYRHYTHMS

Wednesday, February 11, 2009

Page 4: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

A few words about Loudness

• Loudness is the perceptual attribute that covaries with the intensityof sounds (loudness is the subjective attribute, intensity is thephysical, acoustical property)

• We mentioned that the auditory system has a huge dynamic range, over a factor of 100,000 between the sound pressure level of thesoftest and the loudest sounds.

• Loudness is important in music for several reasons – Listening level (louder music is more salient, captures attention) – Onsets and accents (loudness contrast accents notes) – Dynamics (changes in loudness communicate tension, relaxation) – Safety issues (listening to music at high levels (>100 dB SPL) for

prolonged periods of time will damage your ears and impair yourability to hear music

Wednesday, February 11, 2009

Page 5: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Typical sound levels in music On origins of music dynamics notationhttp://www.wikipedia.org/wiki/Pianissimo

Text removed due to copyright restrictions. See the Wikipedia article.

• Pain > 130 dB SPL • Loud rock concert 120 dB SPL • Loud disco 110 dB SPL • fff 100 dB

SPL

• f (forte, strong) 80 dB SPL

• p (piano, soft) 60 dB SPL

• ppp 40 dB SPL

• Lower limit • Theshold of hearing 0 dB SPL

Tuesday, February 10, 2009

Page 6: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Typical sound pressure levels in everyday life

Disco

Courtsey of WorkSafe, Department of Consumer and Employment Protection, Western Australia (http://www.safetyline.wa.gov.au).

http://www.safetyline.wa.gov.au/institute/level2/course18/lecture54/l54_03.asp Wednesday, February 11, 2009

Page 7: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

2 4 88 102 2 4 8 103 2 4 8 104-45 -40 -35 -30 -25 -20 -15 -10 -5 0 5

Rel

ativ

e R

espo

nse

(dB

) C

B

B, C

A

A

Frequency (Hz)

Sound level meters and frequency weightings

A: based on human equal loudness contours @ ~40 dB SLFletcher-Munson curves (recently revised)

C: flat-weighting

Graph by MIT OpenCourseWare. SPL meter photo courtesy of EpicFireworks on Flickr.

Wednesday, February 11, 2009

Page 8: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Dynamic range of some musical instruments

Images removed due to copyright restrictions.

Graphs of relative intensity vs. pitch for different instruments: violin, double bass, flute, B-flat clarinet, trumpet, french horn. Figure 8.5 in Pierce, J. R. The Science of Musical Sound. Revised ed. New York, NY: W.H. Freeman & Co., 1992. ISBN: 9780716760054.

Noise floor: ~ 45 dB SPL Conversation: 60 dB SPL Symphony: 80-90 dB SPLDisco: 100 dB SPL

Wednesday, February 11, 2009

Page 9: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Hearing loss with age (overexposure to loud sounds accelerates this process)

Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

Page 10: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Steps to prevent hearing damage

Use earplugs (reduce levels by 20-30 dB) Photo courtesy of KarenD on Flickr.

Take measures in recurring situations where you experience ringing in your ears (concerts, discos) Impulsive loud sounds are worst (gunshots, hammering)

Be wary of cranking up the level in cars, especially when thewindows are down (if it sounds terribly loud when youʼre stopped at a light, this should tell you something)

With personal sound players (MP3s, iPods, walkman),always set the listening level in quietdonʼt crank up the level in noisy situationsuse the volume limiter feature (set this in quiet)if you listen in noisy situations (mowing the lawn), then by allmeans use noise-cancellation headphones

Wednesday, February 11, 2009

Page 11: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Pitch in music

• Tonal music is based in large part on pitch relations

• Sequences of pitches constitute melodies • Relations between combinations of pitches

constitute harmonies • Sets of pitches make up musical scales, which

are the perceptual atoms of musical tonality

• Musical pitch is relative pitch (transpositionalinvariance)

• We will discuss absolute pitch later in the course

Wednesday, February 11, 2009

Page 12: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Operational definition of “pitch”

• Pitch is that auditory quality that varies with the periodicity and/orfrequency of sounds.

• (i.e. not loudness, duration, location, or timbre)

• Operationally, pitch is defined as the frequency of a pure tone to which a sound is matched.

• Since pure tone pitch changes very slightly (0-4%) with large changesin sound pressure level (40 dB), the level of the reference tone alsohas to be specified.

• For musical sounds (complex tones), this much celebrateddependence of pitch on level is quite minimal (< 1% over 40 dB). Themore harmonics, the smaller the effect.

Wednesday, February 11, 2009

Page 13: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Pitch metamery(perceptual

equivalence)

Sounds with different frequency spectra can

produce the same pitch

Figure by MIT OpenCourseWare.

Wednesday, February 11, 2009

Page 14: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Range of pitches of pure & complex tones • Pure tone pitches

– Range of hearing (30-20,000 Hz)

– Range in tonal music (100-4000 Hz)

– Pitches of individual partials in a complex, "analytical" pitch

• Most (tonal) musical instruments produce harmonic complexes that evoke pitches at their fundamentals (F0ʼs) – Called virtual pitch, periodicity pitch, low pitch

– Range of F0ʼs in tonal music (30-4000 Hz)

– Range of missing fundamental (30-1200 Hz)

Wednesday, February 11, 2009

Page 15: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Music spectrograms

Pitch is not simply frequency

Music pitches are not pure tones --They are mostly harmonic complex tones

The pitch that is heard for a harmonic complex tone corresponds to the fundamental frequency of the tone (with very few exceptions)

Wednesday, February 11, 2009

Page 16: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

“Virtual” pitch: F0-pitch as pattern completion

Figure by MIT OpenCourseWare.

“Missing fundamental“ analogy to illusory contour

Wednesday, February 11, 2009

Page 17: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Fundamentals and harmonics

• Periodic sounds (30-20kHz)produce pitch sensations.

• Periodic sounds consist of repeating time patterns.

• The fundamental period (F0) is theduration of the repeated pattern.

• The fundamental frequency is therepetition frequency of the pattern.

• In the Fourier domain, thefrequency components of a periodic sound are all members ofa harmonic series (n = 1*F0, 2*F0,3*F0...).

• The fundamental frequency istherefore the greatest commondivisor of all of the componentfrequencies.

• The fundamental is also therefore a subharmonic of all componentfrequencies.

Wednesday, February 11, 2009 Figure by MIT OpenCourseWare.

Page 18: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

25

25

Emergent pitch

Missing Line spectra Autocorrelation (positive part)F0

600

10

5

0

0 200 400 600 800 1000 1200 1400 1600 1800 2000 0 5 10 15 20

Pure tone200 Hz

0 200 400 600 800 1000 1200 1400 1600 1800 2000

0 5 10 15 20

Wednesday, February 11, 2009

Page 19: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Harmonic series

A harmonic series conists of integer multiples of a fundamental frequency, e.g. if the fundamental is 100 Hz, then the harmonic series is: 100, 200,300, 400, 500, 600 Hz, .... etc.

The 100 Hz fundamental is the first harmonic, 200 Hz is the secondharmonic. The fundamental is often denoted by F0.

The fundamental frequency is therefore the greatest common divisor of allthe frequencies of the partials.

Harmonics above the fundamental constitute the overtone series.

Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25, 20, 16.6 Hz etc. Subharmonics are also called undertones.

The fundamental period is 1/F0, e.g. for F0=100 Hz, it is 1/100 sec or 10 Wednesday, February 11, 2009

Page 20: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Auditory system:Frequency analyzer vs. Periodicity analyzer

Conceptual models of resonance:

Helmholz resonator band-pass filter -- one frequencyfrequency decompositionsimple oscillator models

Stringmany harmonicscomb filter

Figure by MIT OpenCourseWare.

Complex oscillator - delay loopmatched filter, complex pattern generator

Figure by MIT OpenCourseWare.

Wednesday, February 11, 2009

Page 21: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Helmholtz resonator http://www.phys.unsw.edu.au/jw/Helmholtz.html

Courtesy of Joe Wolfe. Used with permission. http://www.phys.unsw.edu.au/music

Courtesy of Joe Wolfe. Used with permissions. http://www.phys.unsw.edu.au/music

Wednesday, February 11, 2009

Page 22: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Sirens

A simple siren is produced by forcing compressed air through equally spaced holes on a rotating disk. This produces a periodic vibration whose frequency equals De la Tour's sirenthe rate of holes passing by the air nozzle.

Wednesday, February 11, 2009

Page 23: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Early sound analysis of vowels

In this 19th century apparatus developed by Koenig, waveforms were visualized by viewing a flame reflected on a rotating mirrored drum. Vowel sounds resulted in the same flame pattern regardless of their pitch level.

Wednesday, February 11, 2009

Page 24: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

.

Stimulus

array of cochlear band-pass filters auditorynerve fiber

0 3

dB

Frequency (kHz)

50

0

F0= 80 Hz

tuning curves discharge rates interspike intervals

Power spectrumrepresentation

Frequency domain Population rate-

place profile

frequencyoptimal (linear scale)match F0 = 200 Hz F0 = 160 Hz

F0 = 100 Hz harmonic templates

Autocorrelation representationTime domain

Populationinterspike interval

distribution

0

1/F01/F1

5 10 15 Interval (ms)

# inte

rvals

Pitch → best fitting template

Wednesday, February 11, 2009

Page 25: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Correlograms

Wednesday, February 11, 2009

Images removed due to copyright restrictions. See Figures 6.16A-D and 6.17 in Lyon, R. and S. Shamma. "Auditory Representations of Timbre and Pitch."In Auditory Computation. Edited by R. R. Fay. New York, NY: Springer, 1996.

Page 26: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Licklider virtual pitch masking demonstration (iTunes/Spectrograph)

Wednesday, February 11, 2009

Page 27: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Periodic sounds produce distinctpitches Strong

pitches

Many differentsounds producethe same pitches

Strong• Pure tones • Harmonic complexes• Iterated noise

Weaker • High harmonics• Narrowband noise

Weaker low

Very weak pitches• AM noise • Repeated noise Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

Page 28: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

As harmonic numbers increase, the missingfundamental gets weaker.

Wednesday, February 11, 2009

Page 29: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

• Highly precise percepts Pitch basics – Musical half step: 6% change F0 – Minimum JND's: 0.2% at 1 kHz (20 usec time difference, comparable to ITD jnd)

• Highly robust percepts – Robust quality Salience is maintained at high stimulus intensities – Level invariant (pitch shifts < few % over 40 dB range) – Phase invariant (largely independent of phase spectrum, f < 2 kHz)

• Strong perceptual equivalence classes – Octave similarities are universally shared

– Musical tonality (octaves, intervals, melodies) 30 Hz - 4 kHz

• Perceptual organization (“scene analysis”) – Fusion: Common F0 is a powerful factor for grouping of frequency components

• Two mechanisms? Temporal (interval-based) & place (rate-based)

– Temporal: predominates for periodicities < 4 kH (level-independent, tonal) – Place: predominates for frequencies > 4 kHz(level-dependent, atonal)

Wednesday, February 11, 2009

Page 30: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Pure tone pitch discriminationbecomes markedly worseabove 2 kHz

Weber fractions for frequency (∆f/f) increase1-2 orders of magnitudebetween 2 kHz and 10 kHz

Figure by MIT OpenCourseWare.

Page 31: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Wednesday, February 11, 2009

JND's

Figure by MIT OpenCourseWare.

Page 32: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Pure tone pitchdiscrimination improves

at longer tone durations

and

at highersound pressurelevels

Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

Page 33: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

"Pitchedness" as a function of sound duration

Wednesday, February 11, 2009

Graph removed due to copyright restrictions.Figure 36, comparing "Tone pitch" and "click pitch" response. In Licklider, J. C. R. "Basic Correlates of theAuditory Stimulus." Handbook of experimental psychology. Edited by S. S. Stevens. Oxford, UK: Wiley,1951. pp. 985-1039.

Page 34: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

8k6Frequency ranges of (tonal) musical instruments 54

> 6 kHz 2.5-4 kHz 32

27 Hz 110 262 440 880 4 kHzHz Hz Hz Hz

Wednesday, February 11, 2009

Page 35: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

The neural coding problem in audition:

How does the brain represent andprocess acoustic patterns, such that wehear what we hear?

In particular, how does it represent periodic sounds, such that we hearpitches at the fundamentals of musicalsounds?

Wednesday, February 11, 2009

Page 36: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Where everything takes place:from cochlea to cortex, and beyond

10,000k

500k

30k

3k

Primaryauditory cortex

(Auditory forebrain)

Auditory thalamus

Inferior colliculus (Auditory midbrain)

Lateral lemniscus

Auditory brainstem

Auditory nerve (VIII)

Cochlea

Wednesday, February 11, 2009 Figure by MIT OpenCourseWare.

Page 37: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Hardware

Computations

Decisions

Neural architectures

Neural codes

Functions

Information-processingoperations

Sensoryencodings

Motor commands

External world

Receptors

Effectors

Reverse-engineering

the brain

Signals

Wednesday, February 11, 2009

Page 38: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

Page 39: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Ear and cochlea

Figure by MIT OpenCourseWare.

Wednesday, February 11, 2009

Tympanic Membrane

Stapes on Oval Window

Cochlear Base

Scala Vestibuli

Basilar Membrane

Relative A

mplitude

Scala Tympani “Unrolled” Cochlea

Cochlear Apex

Helicotrema

Narrow Base of Basilar Membraneis “tuned” for highfrequencies

Wider apex is“tuned” for lowfrequencies

Distance from Stapes (mm)

1600 Hz

800 Hz

400 Hz

200 Hz

100 Hz

50 Hz

25 Hz

0 10 20 30

Traveling waves along the cochlea. A traveling wave is shown at a given instant along the cochlea, which has been uncoiled for clarity. The graphs profile the amplitude of the traveling wave along the basilar membrane for different frequencies, and show that the position where the traveling wave reaches its maximum amplitude varies directly with the frequency of stimulation.(Figures adapted from Dallos, 1992 and von Bekesy, 1960)

Page 40: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

~30,000 fibers (humans)

Wednesday, February 11, 2009

Page 41: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Neural frequency tuningPLACE PRINCIPLE Disconnect between cochlear tuning& pitch discrimination for freqs < 4 khz

CF = characteristic frequency

Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

Page 42: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

.

Temporal coding in the auditory nerve Cat, 100x @ 60 dB SPL

10

1

Char

acte

ristic

freq

. (kH

z)

0 5010 20 30 40 Peristimulus time (ms)

Wednesday, February 11, 2009

Page 43: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

LowCF

HighCF

F0

F1

F2

F3

Peristimulus time (ms)

Time domain analysis of auditory-nerve fiber firing rates.Hugh Secker-Walker & Campbell Searle, J. Acoust. Soc. 88(3), 1990Neural responses to /da/ @ 69 dB SPL from Miller and Sachs (1983)

High CFs

Low CFs

F1

F2

F3

Peristimulus time (ms)

Wednesday, February 11, 2009

Reprinted with permission, from Secker-Walker HE, Searle CL. 1990. "Time-domain Analysis of Auditory-nerve-fiber Firing Rates." J Acoust Soc Am 88 (3):1427-36. Copyright 1990, Acoustical Society of America.

Page 44: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Phase-locking in auditory nerve fibers

250 Hz tone

See Javel E, McGee JA, Horst W, Farley GR, "Temporal mechanisms in auditory stimulus coding." In: G. M. Edelman, W. E. Gall and W. M. Cowan, ed, Auditory Function: NeurobiologicalBases of Hearing, Wiley: New York 1988; p. 518.

Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

Page 45: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Figure 4. Responses of two auditory nerve fibers with different CF's to 100 presentationsof a single formant vowel, 60 dB SPL. A-F. Unit 25-19, CF = 950 Hz, near the formantregion. H-K. Unit 25-91, CF = 2.1 kHz, well above the formant region. A. Vowelwaveform. Fundamental period 1/F0 (line) is 12.5 ms, F0 = 80 Hz. B. Dot raster display ofindividual spike arrival times. C. Peristimulus time histogram of spike arrival times. D.Stimulus autocorrelation function. Vertical line indicates the fundamental period, 1/F0 =voice pitch. E. Histogram of first-order interspike intervals (between consecutive spikes).Arrows indicate intervals near the fundamental/pitch period. F. Histogram of all-orderintervals (between both consecutive and nonconsecutive spikes). H-K. Correspondinghistograms for the second fiber.

A D

AuditoryFirst order nerve fiber

B 100 E 25

intervals

responsesto a periodicsound 0 0

C 25 F 30 All order

0 0

0 10 20 30 40 50 0 5 10 15 20 25

H 100 J 14 First order intervals

0

I 30 K 20 All order intervals

0 0 0 10 20 30 40 50 0 5 10 15 20 25

Peristimulus time (ms) Interspike interval (ms)

Wednesday, February 11, 2009

Page 46: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

!

!"#$%&'()$*("#$%+,-&*("*#.$*,/0(#1%2*"$%+$*$"310$*&45/-/&*'$%(10(3(4$&*

Wednesday, February 11, 2009Figure by MIT OpenCourseWare.

Page 47: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Global temporal pitch representation (Collaborator B. Delgutte)

Pitch (frequency) = the predominant interval or interval pattern

Pitch strength (salience) = the relative fraction of

pitch-related intervals inthe whole distribution

Detectability: A pitch can beheard iff its salience

•All-order interspike intervals •Population-wide distribution:

–All auditory nerve fibers –(all CFs, all SRs)

Predictions

Wednesday, February 11, 2009

Page 48: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

. .

Stimulus waveform 1/F0 Stimulus Autocorrelation Pitch =1/F0

Construction of the population-interval

distribution Post-stimulus time histograms All-order interval histograms

of the auditory nerve

(Cariani & Delgutte,J. Neurophysiol.

1996) Ch

arac

teris

tic fr

eque

ncy (

kHz)

5

2

1

.5

.2

Population-PST histogram Population-interval histogram 250 1/F0 500

# spik

es

# inte

rvals

240 250 260 270 280 290 0 5 10 15 Peristimulus time (ms) Interval (ms)

Wednesday, February 11, 2009

Source: Cariani, P. A., and B. Delgutte."Neural Correlates of the Pitch of Complex Tones. I. Pitch and Pitch Salience."J Neurophysiol 76 (1996): 1698-1716.

[0022-3077/96].Courtesy of the American PhysiologicalAssociation. Used with permission.

Page 49: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

.

Stimulus Autocorrelation Pitch =1/F0

Cha

ract

eris

tic fr

eque

ncy

(kH

z)

All-order interval histograms

5

2

1

.5

.2

Population-interval histogram500

Autocorrelation functions

Fundamental Corr(τ) =Σ S(t) S(t- τ)period t 1/F0

Shift MultiplySum the productsfor each delay τ to computeautocorrelation function

time lagτ

Autocorrelations Histograms of=of spike trains all-order intervals

00000010000000001000000000000000000001000000000010000000 00000010000000001000000000000000000001000000000010000000

# in

terv

als

0 5 10 15

Interval (ms) Source: Cariani, P. A., and B. Delgutte. "Neural Correlates of the Pitch of Complex Tones. I. Pitch and Pitch Salience." J Neurophysiol 76 (1996): 1698-1716. [0022-3077/96]. Courtesy of the American Physiological Association. Used with permission.

Wednesday, February 11, 2009

Page 50: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Many differentsounds produce Strongthe same pitches pitches

Strong• Pure tones • Harmonic complexes• Iterated noise

Weaker • High harmonics• Narrowband noise

Very weak• AM noise • Repeated noise

Weaker low

pitches

Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

Page 51: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

125 Hz 250 Hz 500 Hz

Manydifferent sounds produce

the same pitch

pitchmetamery

Fastl, H. & Stoll, G. Scaling of pitch strength,Hearing Research1(1979): 293-301

Wednesday, February 11, 2009 Courtesy Elsevier, Inc., http://www.sciencedirect.com. Used with permission.

Page 52: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Use the structure of perception to find neural codes:1. Use stimuli that produce equivalent percepts2. Look for commonalities in neural response3. Eliminate those aspects that are not invariant

Metameric Neural Neural codes, representations:stimuli response those aspects of the

(same percept, pattern neural responsethat play a functional roledifferent power spectra) in subservingthe perceptual distinctionStimulus A

(AM tone, fm = 200 Hz)

CandidateStimulus B "neural codes"(Click train, F0 = 200 Hz) or representations

Common aspectsIntensity of neural response

that covarywith the percept

of interestOther parametersfor which percept is

invariant Location

Duration

Wednesday, February 11, 2009

Page 53: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Pitch equivalence classes(keep the percept constant, identify neural response invariances)

Six stimuli that produce a low pitch at 160 Hz Populationinterval

Waveform Power spectrum Autocorrelation distribution

Interspikeinterval (ms)

150

Pure tone 160 Hz

AM tone Fm:160 Hz Fc:640 Hz

Harms 6-12

AM tone

Click train

AM noise

F0 :160 Hz

Fm :160 Hz Fc:6400 Hz

Fm :160 Hz

Pitch frequency Pitch period

F0 : 160 Hz

Lag (ms) Frequency (Hz) TIme (ms) 150

WEAK PITCHES

STRONG PITCHES

Wednesday, February 11, 2009

Page 54: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

van Norden, 1981; after Ritsma (1962)

Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

Page 55: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Level-invariance Pitch equivalence

Wednesday, February 11, 2009 Figure by MIT OpenCourseWare.

Page 56: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

.A correlational representation for loudness? Correlation between

Stimulus autocorrelation population-intervaldistributions and stimulus autocorrelation serves as an index of the amount of common, stimulus-driven time structure in the auditory nerve array.

Population-interval distributions As the sound pressure150 level increases, a

40 dB SPL progressively greater

r = 0.62 fraction of the activity 31 fibers in the array is stimulus

locked and mutuallycorrelated.

1000 60 dB SPL This representation

r = 0.70 makes use of the 61 fibers dynamic range of the

entire auditory nerve 400 80 dB SPL

array, and leads to a theory of partialloudnesses of multiple

r = 0.77 auditory objects in a17 fibers very straightforward0 5 10 15 20 25 way.

# in

terv

als

All-order interval (msec)Wednesday, February 11, 2009

Page 57: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Wednesday, February 11, 2009

Source: Cariani, P. A., and B. Delgutte."Neural Correlates of the Pitch of Complex Tones.I. Pitch and Pitch Salience." J Neurophysiol 76(1996): 1698-1716. [0022-3077/96].Courtesy of the American PhysiologicalAssociation. Used with permission.

Page 58: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

0.015

0 5 100

2

4

10 15

5 10 15

ANF ANF AUTOPopulation-intervalDATA SIMULATION CORRELATIONdistributions and

autocorrelation functions A Pure Tone F=160 Hz

4

3 4

2Pure Tone 2

1

0160 Hz 0 5 10 15 0 5

B AM Tone Fc=640 Hz, Fm=160 Hz

4 4 3.5

AM Tone 3 3 62.5

1.5Fm =160 Hz 1

2

1

2

2

4

0 5 10 15 00 0.005 0.01 00

Fc = 640 Hz 0.5

60

C Click train F0= 160 Hz

4 54.54

Click Train 3 3.53

2 2.51.5F0 = 160 Hz 12

1 0.5

0 5 10 15 00 0.005 0.01 0.015

D AM noise Fm= 160 Hz

4

AM Broadband Noise 3

Fm = 160 Hz 2

1

0 5 10 15 0 5 10 15 0 5 10 15Interval (ms) Interval (ms) Interval (ms)

Wednesday, February 11, 2009

Page 59: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

.

.

.

Inte

rval

(ms)

The running population-interval distribution

Population autocorrelogram 79 AN fibers,~100,000 spikesA B15

1/F0

10

0 100 200 300 400 500 Peristimulus time (ms)

Population intervalhistograms

(cross sections)

1000 A

# in

terv

als

0 5 10 15

Interval (ms)5

1000

0 155 10# in

terv

als B

Interval (ms)

Wednesday, February 11, 2009

Page 60: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Pitch shift of inharmonic complex tones

Fm = 125 Hz Stimulus waveform Stimulus autocorrelation Population intervalFc = 750 Hz 1/Fm 1/Fm

distributions

800 1/Fm

n=6

0 1 220 240 260

# inte

rvals

800

n = 5.86

0 1 260 280 300 * *

# inte

rvals

800

n = 5.5

# inte

rvals

* *

0 1 300 320 340

800

n = 5

# inte

rvals

1 340 360 380 0 5 10 15 0 5 10 15Freq (kHz) Peristimulus time (ms) Lag (ms) Interval (ms)

Wednesday, February 11, 2009

0

Page 61: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

.

C Fc=640 Hz, Fm=160-320 Hz Pitch and In

terv

al (m

s)

E G

1/Fm

5 de Boer's rule

Peristimulus time (ms)Wednesday, February 11, 2009

Page 62: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Pitch shift of inharmonic complex tonesPhase-invariant nature of all-order interval code

Wednesday, February 11, 2009

Page 63: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Wednesday, February 11, 2009

Page 64: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

.

Inte

rval

(ms)

Cochlear nucleus IV: Pitch shift Variable-Fc AM tone Fm = 125 Hz Fc = 500-750 Hz Pitch ~ de Boer's rule

Pooled ANF (n=47) Fc Chop-S (PVCN) Unit 35-40 CF: 2.1 kHz

500

de Boer's rule

(pitches)

750 500 (Hz) Thr: 5.3 SR: 17.7

15 15

10

5

0 100 200 300 400 500 0 100 200 300 400 45-17-4 CF: 408 Hz 45-15-8 CF:4417

10 1/Fm

Thr: -18, SR: 39 s/s

10

0 500100 200 300 400 Peristimulus time (ms)

Inte

rval

(ms)

0 500100 200 300 400

30 (AVCN)

Peristimulus time (ms)

de Boer's

(pitches)15 sub-

harmonics of Fc

1/Fc

5

PrimarylikeThr: 21.3 SR: 159 30

Pauser (DCN)

rule 20

Wednesday, February 11, 2009

Page 65: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Dominance region for pitch (harmonics 3-5 or partials 500-1500 Hz)

Wednesday, February 11, 2009

Page 66: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Wednesday, February 11, 2009

Page 67: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Coding of vowel quality (timbre)

Wednesday, February 11, 2009

Page 68: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Inte

rval

s (m

s)In

terv

als

(ms)

Pitch masking Variable-F0 click train in broad-band noise Click train: F0= 160-320 Hz, positive polarity, 80 dB SPLPeak/background ratio: intervals @ 1/F0 ± 150 usec/mean over all intervals

What degree of temporal Informal pitch thresholds, s/n dB: 12 (MT), 13 (PC), 16 (BD)

correlation is necessary forpitch to become audible?

s/n: 8 dB p/b = 1.11 s/n: 32 dB p/b = 2.38

s/n: 20 dB p/b = 1.57 s/n: no noise p/b =3.26

Peristimulus time (ms) Wednesday, February 11, 2009

Page 69: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

.

Vowels

Population-interval coding of timbre (vowel formant structure) Signal autocorrelation [ae]

Voice pitch Population-wide distributions ofshort intervals for 4 vowels

Formant-structure

mag

nitu

de

0

0 5 10 15 Population interval histogram

[α] [∍]

1

# in

terv

als

[u]Voice pitch [æ]Formant-

0 5 0 5

0 5 10 15

1/F1

structure

# in

terv

als

Interval (ms)Interval (ms)

Wednesday, February 11, 2009

Page 70: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

LowCF

HighCF

F0

F1

F2

F3

Peristimulus time (ms)

Time domain analysis of auditory-nerve fiber firing rates.Hugh Secker-Walker & Campbell Searle, J. Acoust. Soc. 88(3), 1990Neural responses to /da/ @ 69 dB SPL from Miller and Sachs (1983)

High CFs

Low CFs

F1

F2

F3

Peristimulus time (ms)

W

Reprinted with permission, from Secker-Walker HE, Searle CL. 1990. "Time-domain Analysis of Auditory-nerve-fiber Firing Rates." J Acoust Soc Am 88 (3):1427-36. Copyright 1990, Acoustical Society of America.ednesday, February 11, 2009

Page 71: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

SummaryPopulation-interval representation of pitch

at the level of the auditory nerve

Pitch of the AM tone Power spectrum Autocorrelation

"missingfundamental"

Pure tone AM tone Click train AM noisePitch Equivalence

40 dB SPL 60 80

Level invariance

n = 6 5.86 5.5 Pitch shift of inharmonic AM tones

Phase invariance AM tone

QFM tone

1/F06-12 1/F03-5

Dominance region F03-5=160 Hz 240 Hz 320 Hz 480 Hz

Strong pitch Weak pitch V. weak pitch

Pitch salience

Wednesday, February 11, 2009

Page 72: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Temporal coding of pitch in the auditory nerve

Pitch = predominant all-order interspike interval

Pitch strength = relative proportion of pitch-related intervals

Timbre (tone quality) = pattern of other intervals

Stimulus autocorrelation ~ population-interval distribution

Readily explains:• Pitch equivalence classes• Invariance w. ∆ sound pressure level • Invariance w. ∆ waveform envelope, phase spectrum• Existence region of musical tonality (octaves, melody)• Pitches of resolved & unresolved harmonics • Pitches of harmonic & inharmonic tone complexes

Wednesday, February 11, 2009

Page 73: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

τe ~ 30 ms

OCTAVESIMILARITYFROMSIMILARITYOFINTERSPIKEINTERVALREPRESENTATIONS

Wednesday, February 11, 2009 This image is from the article Cariani, P. "Temporal Codes, Timing Nets, and Music Perception." Journal of New Music Research 30, no. 2(2001): 107-135. DOI: 10.1076/jnmr.30.2.107.7115. This journal is available online at http://www.ingentaconnect.com/content/routledg/jnmr/

Page 74: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Correlations between population-interval patternsPure tones Harmonics 1-6

R-values (0-1)

Wednesday, February 11, 2009

This image is from the article Cariani, P. "Temporal Codes, Timing Nets, and Music Perception." Journal of New Music Research 30, no. 2 (2001): 107-135.DOI: 10.1076/jnmr.30.2.107.7115. This journal is available online at http://www.ingentaconnect.com/content/routledg/jnmr/

Page 75: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Existence region of musical tonalityis coextensive with spike timing information

Strong phase-locking (temporal information)

Phase-locking as measured bysynchronization indexdeclines dramatically

above ~4 kHz

Musical tonality: octaves, intervals, melodies

temporal representationlevel-invariant, precise

place representationlevel-dependent, coarse

30 100 1k 10k

Frequency (kHz) Wednesday, February 11, 2009

Page 76: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Duplex time-place representations

temporal representationlevel-invariant

• strong (low fc, low n)• weak (high fc, high n; F0 < 100 Hz)

place-based representationlevel-dependent

coarse

30 100 1k 10k

Similarity cf. Terhardt's to spectral and virtual pitchinterval

pattern

Similarity to place pattern

Wednesday, February 11, 2009

Page 77: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Two codes for “pitch

Place code

Pitch = place of excitation

Pitch height

Absolute

Low vs. high

Existence region f: .100-20,000 Hz

Frequency analysisFourier spectrum

Time code

Pitch = dominant periodicity

Musical pitch"Chroma", Tonality

Relational

Musical intervals, tonalityMelodic recognition & transposition

Existence regionF0: 30-4000 Hz

Periodicity analysisAutocorrelation

Wednesday, February 11, 2009

Page 78: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Pitch dimensions: height & chroma

Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

Page 79: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Pitch height and pitch chroma

Wednesday, February 11, 2009

Images removed due to copyright restrictions.Figures 1, 2, and 7 in Shepard, R. N. "Geometrical Approximations to the Structure of Musical Pitch."Psychological Review 89, no. 4 (1982): 305-322.

Page 80: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Temporal theories - pros & cons

Make use of spike-timing properties ofelements in early processing (to midbrain at least)

Interval-information is precise & robust & level- insensitive No strong neurally-grounded theory of how this

information is used

Unified model: account for pitches of perceptually- resolved & unresolved harmonics in an elegantway (dominant periodicity)Explain well existence region for F0 (albeit with limits on max interval durations)Do explain low pitches of unresolved harmonics

Interval analyzers require precise delays & short coincidence windows Wednesday, February 11, 2009

Page 81: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Spectral pattern models

Mostly conceived within a frequency analysis framework

“Auditory filters” derived from psychophysics, notphysiology

Wednesday, February 11, 2009

Page 82: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Shapes of perceptually-derived "auditory filters" (Moore)Donʼt conflate these with cochlear filters or auditorynerve excitation patterns! Auditory filters are derived from psychophysical data & reflect the response of thewhole auditory system. For lower frequencies and higherlevels AFs have much narrower bandwidths than cochlear resonances or auditory nerve fiber responses.

Wednesday, February 11, 2009 Figures by MIT OpenCourseWare.

Page 83: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

From masking patternsto "auditory filters" as amodel of hearing

Power spectrumFilter metaphor

Notion of one central spectrum that subserves

2.2. Excitation pattern Using the filter shapes and bandwidths derived from masking experiments we canproduce the excitation pattern produced by a sound. The excitation pattern shows how much energy comes through each filter in a bank of auditory filters. It is analogous to the pattern of vibration on the basilarmembrane. For a 1000 Hz pure tone the excitation pattern for a normal and for a SNHL (sensori-neural hearing loss) listener look like this: The excitation pattern to a complex tone is simply the sum of the patterns to the sine waves that make up the complex tone (since the model is a linear one). We can hear out a tone at a particular frequency in a mixture if there is a clear peak in the excitation pattern at thatfrequency. Since people suffering from SNHL have broader auditory filters their excitation patterns do not have such clear peaks. Sounds mask each other more, and so they have difficulty hearing sounds (such asspeech) in noise. --Chris Darwin, U. Sussex, http://www.biols.susx.ac.uk/home/Chris_Darwin/Perception/Lecture_Notes/Hearing3/ hearing3.html

Wednesday, February 11, 2009

Courtesy of Prof. Chris Darwin (Dept. of Psychology at the University of Sussex). Used with permission.

Page 84: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Resolvability of harmonics (Plomp, 1976)

Fundamental frequency

(F0)

Wednesday, February 11, 2009

Image removed due to copyright restrictions.Graph of frequency separation between partials vs. frequency of the partial.From Plomp, R. Aspects of Tone Sensation. New York, NY: Academic Press, 1976.

Page 85: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Resolution of harmonics (based on psychophysics)

Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

Page 86: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Goldsteinʼs harmonic templates

Figure removed due to copyright restrictions. Diagram of periodicity pitch as harmonic frequency pattern recognition. Figure 3 in Goldstein, J. L., et al."Verification of the Optimal Probabilistic Basis of Aural Processing in Pitch of Complex Tones." J Acoust Soc Am 63 (1978): 486-510. http://dx.doi.org/10.1121/1.381749

Wednesday, February 11, 2009

Page 87: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

A "two-mechanism" perspective (popular with some psychophysicists)

n= 5-10

30

unresolved harmonics weak temporal mechanism

phase-dependent; first-order intervals

harm

onic

num

ber

place-based representation

level-independentfine

resolved harmonics strong spectral pattern mechanism

phase-independentrate-place? interval-place?

Dominance regionf, F0

place-based representationslevel-dependent

coarse

100 1k 10k

Wednesday, February 11, 2009

Page 88: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Goldstein JL (1970) Aural combination tones. In: Frequency Analysis and PeriodicityDetection in Hearing (Plomp R, Smoorenburg GF, eds), pp 230-247. Leiden: A.W. Sijthoff.

Goldstein JL (1973) An optimum processor theory for the central formation of the pitchof complex tones. J Acoust Soc Am 54:1496-1516.

Julius Goldstein Goldstein JL, Kiang NYS (1968) Neural correlates of the aural combination tone 2f1-f2. IEEE Proc 56:981-992.

references Goldstein JL, Srulovicz P (1977) Auditory-nerve spike intervals as an adequate basis foraural frequency measurement. In: Psychophysics and Physiology of Hearing(Evans EF, Wilson JP, eds). London: Academic Press.

Goldstein JL, Buchsbaum G, First M (1978a) Compatibility between psychophysical andphysiological measurements of aural combination tones. J Åcoust Soc AmModels for 63:474-485.

Goldstein JL, Buchsbaum G, Furst M (1978b) Compatibility between psychophysical andpure tone physiological measurements of aural combination tones... Journal of theAcoustical Society of America 63:474-485.pitch Goldstein JL, Gerson A, Srulovicz P, Furst M (1978c) Verification of the optimal

discrimination, probabilistic basis of aural processing in pitch of complex tones. J Acoust Soc Am63:486-510.

low pitches of H. L. Duifhuis and L. F. Willems and R. J. Sluyter ( 1982,) Measurement of pitch inspeech: An implementation of Goldstein's theory of pitch perception,. jasa, 71,:1568--1580.complex tones, Houtsma AJM, Goldstein JL (1971) Perception of musical intervals: Evidence for thecentral origin of the pitch of complex tones. In: M.I.T./R.L.E.binaural pitches, Houtsma AJM, Goldstein JL (1972) The central origin of the pitch of complex tones:Evidence from musical interval recognition. J Acoust Soc Am 51:520-529.and P. Srulovicz and J. Goldstein ( 1983) A central spectrum model: A synthesis ofauditory nerve timing and place cues in monoaural communication offrequencyaural distortion spectrum,. jasa, 73,: 1266--1276,.

products Srulovicz P, Goldstein JL (1977) Central spectral patterns in aural signal analysis basedon cochlear neural timing and frequency filtering. In: IEEE, p 4 pages. Tel Aviv,Israel.

Srulovicz P, Goldstein JL (1983) A central spectrum model: a synthesis of auditory-nervetiming and place cues in monaural communication of frequency spectrum. JAcoust Soc Am 73:1266-1276.

Wednesday, February 11, 2009

Page 89: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Terhard's method of common subharmonics

Spectral vs. virtual pitch: duplex modelVirtual pitch computation:1. Identify frequency component2. Find common subharmonics 3. Strongest common subharmonic after F0 weighting is the virtual pitchTerhardt's model has been extended by Parncutt to cover pitch multiplicity and fundamental bass of chords

Wednesday, February 11, 2009

Page 90: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Terhardt references

Terhardt E (1970) Frequency analysis and periodicity detection in the sensations ofroughness and periodicity pitch. In: Frequency Analysis and Periodicity Detectionin Hearing (Plomp R, Smoorenburg GF, eds). Leiden: A. W. Sijthoff.

Terhardt E (1974a) On the perception of periodic sound fluctuations (roughness).Acustica 30:201-213.

Terhardt E (1974b) Pitch, consonance, and harmony. J Acoust Soc Am 55:1061-1069.Terhardt E (1977) The two-component theory of musical consonance. In: Psychophysics

and Physiology of Hearing (Evans EF, Wilson JP, eds), pp 381-390. London:Academic Press.

Terhardt E (1979) Calculating virtual pitch. Hearing Research 1:155-182.Terhardt E (1984) The concept of musical consonance: a link between music and

psychoacoustics. Music Perception 1:276-295.Terhardt E, Stoll G, Seewann M (1982a) Pitch of complex signals according to virtual-

pitch theory: test, examples, and predictions. J Acoust Soc Am 71:671-678.Terhardt E, Stoll G, Seewann M (1982b) Algorithm for extraction of pitch and pitch

salience from complex tonal signals. J Acoust Soc Am 71:679-688.

Wednesday, February 11, 2009

Page 91: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

SPINET: Cohen Grossberg, Wyse JASA

Fixed neuralnetwork:connection weightsarrangedso as to formpitch-equivalenceclasses

Wednesday, February 11, 2009

Courtesy of Prof. Stephen Grossberg. Used with permission.Source: Cohen, M. A., S. Grossberg, and L. L. Wyse."A Spectral Network Model of Pitch Perception."Technical Report CAS/CNS TR-92-024, Boston University. Also published in J Acoust Soc Am98, no. 2 part 1 (1995): 862-79.

Page 92: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Broad tuning and rate saturationat moderate levels in low-CF auditory nerve fibers confoundsrate-based resolution of harmonics.

Low SR auditory nerve fiber

Rose, 1971 Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

Page 93: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Spectral pattern theories - pros & cons

Do make use of frequency tuning properties ofelements in the auditory system

No clear neural evidence of narrow (< 1/3 octave)frequency channels in low-BF regions (< 2 kHz)

Operate on perceptually-resolved harmonicsDo not explain low pitches of unresolved harmonics

Require templates or harmonic pattern analyzersLittle or no neural evidence for required analyzersProblems w. templates: relative nature of pitch

Do not explain well existence region for F0Learning theories don't account for F0 ranges or for phylogenetic ubiquity of periodicity pitch

Wednesday, February 11, 2009

Page 94: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Problems with rate-place models

In contrast to musical (F0) pitch percepts....

Rate-place spectral profiles • have coarse resolution (≥ 1 octave) • change with sound level • worsen dramatically at higher sound levels • should work better for high frequencies • cannot account for F0 pitches of unresolved harmonics

Wednesday, February 11, 2009

Page 95: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Char

acte

ristic

freq

. (kH

z)

Some Local Central spectrum

possible Rate-place

auditoryMasking phenomena

representations Loudness CF

10

Synchrony-place Central spectrum

1

Interval-place 0 10 20 30 40 50 CF

Peristimulus time (ms) Pure tone pitch JNDs

Population interval Stages of 1/F0

integration Population-interval All-at-once Complex tone pitch

Global Interval

Wednesday, February 11, 2009

Page 96: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

.

160

Single-formant vowel Single-formant vowel

Cochlear nucleus II Variable F0 Variable F0 Spectra and waveform for F0=80 Spectra and waveform for F0=160

50 50 dB

dB 0

00 Frequency (kHz) 3 0 Time(ms) 25 0 Frequency (kHz) 3 255 Time(ms) 280

F1 = 640 Hz F1 = 640 Hz Pitch ~ 1/F0F0 = 80-160 Hz F0 = 160-320 Hz

80 160 32080 160Pooled 15 15

ANF's 1/F0

10 1/F0pitch

pitch 10

Inte

rval

(ms)

5 5

0 100 200 300 400 500 0 100 200 300 400 500

Unit 45-18-9 CF: 400 Hz Unit 45-13-6 CF: 750 Hz

30 AVCN Pri Thr: 21.4 dB SPL 30 AVCN Pri Thr: 23.6 dB SPL

PriIn

terv

al (m

s)

10

20

0 100 200 300 400 500 Unit 16-5-56 Chop-S F1 = 600 HzF0 = 100-125 Hz

1/F0pitch10

0 100 200 300 400 500

1/F0 20 pitch

ChopS

PauseCF: 4418 Hz Thr: -17.6 dB SPL Unit 45-15-15 CF: 4418 Hz Unit 45-15-16

100 100125CF: 1514 Hz Thr: 10.6 dB SPL

Unit 45-12-6 Chop-S 80 dB SPL CF: 3510 Hz

Thr: 6.9 dB SR: 26 s/s

Inte

rval

(ms)

1/F0pitch

10

20

30

0 500100 200 300 400

10

20

30

0 500100 200 300 400

1/F0pitch

Thr: -17.6 dB SPL Pause Pause SR: 39 s/s SR: 39 s/s 30

1/F0 10 pitch

2/F0 30

20

10

0 100 200 300 400 500 Peristimulus time (ms) Peristimulus time (ms)

Inte

rval

(ms)

0 100 200 300 400 500

20 1/F0pitch

Wednesday, February 11, 2009

Page 97: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Modulation detectors in the midbrain

Stimulus-related temporal discharge patterns Modulation-tuningof discharge rate

See: Günter Ehret (1997) The auditory midbrain, a “shunting yard” of acoustical information processing. In The Central Auditory System, Ehret, G. & Romand, R., eds. Oxford University Press. Langner, G. and Schreiner, C.E. Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms. J. Neurophysiol. 60:1799-1822. Langner (1992) review, Periodicity coding in the auditory system. Hearing Research, 60:115-142.

Wednesday, February 11, 2009

Images removed due to copyright restrictions.See Fig. 2 and 3 in Langner, G. and C. E. Schreiner."Periodicity Coding in the Inferior Colliculus of the Cat. I. Neuronal Mechanisms."J Neurophysiol 60 (1988): 1799-1822.

Page 98: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Pitch-related temporalpatterns in field potentialsin awake monkey cortex

Figure. Averaged cor -tical field potentials(current source densi -ty analysis, lower lami-na 3, site BF=5 kHz)in response to 50 msclick trains F0=100-500 Hz. Ripples up to300-400 Hz show syn-chronized componentof the ensemble-response, FromSteinschneider (1999).

Wednesday, February 11, 2009

Image removed due to copyright restrictions.See Fig. 9 right, in Steinschneider, M., et al."Click Train Encoding in Primary AuditoryCortex of the Awake Monkey: Evidence for TwoMechanisms Subserving Pitch Perception."J Acoust Soc Am 104, no. 5 (1998): 2935-2955. DOI: 10.1121/1.423877.

Page 99: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Wednesday, February 11, 2009

Courtesy of Prof. Mark J. Tramo, M.D., Ph.D. Used with permission.Source: Tramo, Mark J. "Neural Representations of Acoustic Information in Relation to Voice Perception."Havard University PhD Thesis, 1999.

Page 100: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Pure tone temporal response profiles in auditory cortex (A1)

Wednesday, February 11, 2009

Courtesy of Prof. Mark J. Tramo, M.D., Ph.D. Used with permission. Source: Tramo, Mark J. "Neural Representations of Acoustic Information in Relation to Voice Perception." Havard University PhD Thesis, 1999.

Page 101: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Phase-locking in thalamus and cortex

De Ribaupierre: 10% of thalamic units in awake cats with synchronization indices of 0.3 or better to 1-2 kHz tones

Reports of isolated cortical units with phase-lockingto Fm of AM tones up to 1 kHz (Semple)

Pitch detectors in thalamus and cortex

Schwarz & Tomlinson failed to find true F0 detectors in their study of > 200 cortical units in awake macaque

Riquimaroux found 16 units that responded both to pure tonesand harmonic complexes that would evoke the same pitch

Wednesday, February 11, 2009

Page 102: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Bendor & Wang(2005) F0-tuned units in auditory cortex

Wednesday, February 11, 2009

Image removed due to copyright restrictions.See Fig. 1 in Bendor and Wang. "The Neuronal Representation of Pitch inPrimate Auditory Cortex." Nature 436 (2005): 1161-1165.

Page 103: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

The enigma of the central representation of pitch

The tight correspondences between psychophysics and the population interspike interval code strongly suggest that our perception of pitch depends on this information.

Yet, despite some recent advances, we still do not understand how the central auditory system uses this information.

What happens to neural timing information as one ascends the auditory pathway from auditory nerve to cortex?

Is time converted to some sort of place code (e.g. pitch detectors) or is some other kind of code involved?

Wednesday, February 11, 2009

Page 104: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Discharge rate as a function of frequency and intensity

Auditory nerve fiber Rose (1971)

Wednesday, February 11, 2009

Figure by MIT OpenCourseWare.

Page 105: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Frequency response curves for fibers in the cochlear nerve of the squirrel monkey. Left: low spontenous activity; right: high spontaneous activity.

From Rose, J. E., et al. “Some Effects of Stimulus Intensity on Response of Auditory Nerve Fibers in the Squirrel Monkey.” J Neurophysiol 34 (1971): 685–699.

Four graphs removed due to copyright restrictions.See Fig. 2 and Fig. 4 in Rose, J. E., et al. “Some Effects of StimulusIntensity on Response of Auditory Nerve Fibers in the Squirrel Monkey.”J Neurophysiol 34 (1971): 685–699.

Page 106: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

Alan PalmerIn Hearing, Moore ed.

Figure 4 in Palmer, Alan. "Neural Signal Processing." Chapter 3 in Hearing. 2nd ed. Edited by B. C. J. Moore. Academic Press, 1995. [Preview this image in Google Books]

Wednesday, February 11, 2009

Page 107: Neural Representation of Musical Pitch - aprende.org · 2017. 1. 20. · Subharmonics are integer divisions of the fundamental: e.g. for F0= 100 Hz, subharmonics are at 50, 33, 25,

MIT OpenCourseWarehttp://ocw.mit.edu

HST.725 Music Perception and Cognition Spring 2009

For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms.