MUSI-6201 | Computational Music Analysis...2015/10/05  · overview intro perception music features...

Preview:

Citation preview

overview intro perception music features summary

MUSI-6201 — Computational Music AnalysisPart 5.1: Intensity

alexander lerch

November 4, 2015

overview intro perception music features summary

instantaneous featuresoverview

text bookChapter 4: Intensity (pp. 71–78)

sources: slides (latex) & Matlab

github repository

lecture contentloudness perception and decibelsdynamics in musicinstantaneous features

overview intro perception music features summary

instantaneous featuresoverview

text bookChapter 4: Intensity (pp. 71–78)

sources: slides (latex) & Matlab

github repository

lecture contentloudness perception and decibelsdynamics in musicinstantaneous features

overview intro perception music features summary

instantaneous featuresoverview

text bookChapter 4: Intensity (pp. 71–78)

sources: slides (latex) & Matlab

github repository

lecture contentloudness perception and decibelsdynamics in musicinstantaneous features

overview intro perception music features summary

instantaneous featuresoverview

text bookChapter 4: Intensity (pp. 71–78)

sources: slides (latex) & Matlab

github repository

lecture contentloudness perception and decibelsdynamics in musicinstantaneous features

overview intro perception music features summary

intensity, magnitude & loudnessintroduction

intensity-related descriptors commonly usedwaveform view

t [s]

0 5 10

x(t

) (p

op

)

-1

-0.5

0

0.5

1

t [s]

0 5 10

x(t

) (s

trin

gq

ua

rte

t)

-1

-0.5

0

0.5

1

t [s]

0 5 10

x(t

) (s

pe

ech

)

-1

-0.5

0

0.5

1

level monitoring (PPM, VU,. . . )

terms and definitions

magnitudeintensityenvelopelevelvolumevelocityloudness

overview intro perception music features summary

intensity, magnitude & loudnessintroduction

intensity-related descriptors commonly usedwaveform view

t [s]

0 5 10

x(t

) (p

op

)

-1

-0.5

0

0.5

1

t [s]

0 5 10

x(t

) (s

trin

gq

ua

rte

t)

-1

-0.5

0

0.5

1

t [s]

0 5 10

x(t

) (s

pe

ech

)

-1

-0.5

0

0.5

1

level monitoring (PPM, VU,. . . )

terms and definitionsmagnitudeintensityenvelopelevelvolumevelocityloudness

overview intro perception music features summary

intensity, magnitude & loudnesshuman perception 1/2

perception has non-linear relation to intensity:

model: logarithmic relation

vdB(n) = 20 · log10

(v(n)

v0

)v0: reference constant (0 dB point)digital: v0 = 1 ⇒ dBFSscaling factor: 1 dB ≈ JNDL

overview intro perception music features summary

intensity, magnitude & loudnesshuman perception 1/2

perception has non-linear relation to intensity:

model: logarithmic relation

vdB(n) = 20 · log10

(v(n)

v0

)v0: reference constant (0 dB point)digital: v0 = 1 ⇒ dBFSscaling factor: 1 dB ≈ JNDL

overview intro perception music features summary

intensity, magnitude & loudnesshuman perception 1/2

perception has non-linear relation to intensity:

model: logarithmic relation

vdB(n) = 20 · log10

(v(n)

v0

)v0: reference constant (0 dB point)digital: v0 = 1 ⇒ dBFSscaling factor: 1 dB ≈ JNDL

overview intro perception music features summary

intensity, magnitude & loudnessexcursion: level computation

if v(n) = 0 ⇒: computation of log10(0)

work-aroundsa add constant ε

vdB(n) = 20 · log10(v(n) + ε)

b add if statement

vtrunc(n) =

{v(n), if v(n) ≥ εε, otherwise

overview intro perception music features summary

intensity, magnitude & loudnessexcursion: level computation

if v(n) = 0 ⇒: computation of log10(0)work-arounds

a add constant ε

vdB(n) = 20 · log10(v(n) + ε)

vdB [dB]

-40 -35 -30 -25 -20 -15 -10 -5 0

v′ dB−

vdB[dB]

0

5

10

15

20

ǫ = 1e − 01

ǫ = 1e − 02

ǫ = 1e − 03

ǫ = 1e − 04

b add if statement

vtrunc(n) =

{v(n), if v(n) ≥ εε, otherwise

ma

tla

bso

urc

e:m

atl

ab

/d

isp

layL

og

Ep

silo

n.m

overview intro perception music features summary

intensity, magnitude & loudnessexcursion: level computation

if v(n) = 0 ⇒: computation of log10(0)

work-aroundsa add constant ε

vdB(n) = 20 · log10(v(n) + ε)

b add if statement

vtrunc(n) =

{v(n), if v(n) ≥ εε, otherwise

overview intro perception music features summary

intensity, magnitude & loudnesshuman perception 2/2

decibel scale is not loudness scale:

equal-sized steps on the decibel scale not perceived asequal-sized loudness steps

perceptual loudness depends on

frequencycochlear resolutionmasking effects

overview intro perception music features summary

intensity, magnitude & loudnesshuman perception 2/2

decibel scale is not loudness scale:

equal-sized steps on the decibel scale not perceived asequal-sized loudness steps

perceptual loudness depends on

frequencycochlear resolutionmasking effects

overview intro perception music features summary

intensity, magnitude & loudnesshuman perception 2/2

f [Hz]

102 103 104

SPL

[dB]

0

20

40

60

80

100

120

0 phon

20 phon

40 phon

60 phon

80 phon

90 phon

ma

tla

bso

urc

e:m

atl

ab

/d

isp

layE

qu

alL

ou

dn

essC

on

tou

rs.m

overview intro perception music features summary

intensity, magnitude & loudnessdynamics in music

score:

only several rough dynamic steps,e.g.:pp, p, mf, f, ffcomparably vague instructions on volume modifications, e.g.:crescendo, decrescendo, sfdynamics influenced by

instrumentationtimbrenumber of voicescontext and musical tension

MIDI:128 velocity stepsno standardized relation to magnitude, power, . . .

overview intro perception music features summary

intensity, magnitude & loudnessdynamics in music

score:

only several rough dynamic steps,e.g.:pp, p, mf, f, ffcomparably vague instructions on volume modifications, e.g.:crescendo, decrescendo, sfdynamics influenced by

instrumentationtimbrenumber of voicescontext and musical tension

MIDI:128 velocity stepsno standardized relation to magnitude, power, . . .

overview intro perception music features summary

intensity, magnitude & loudnessdynamics in music

score:

only several rough dynamic steps,e.g.:pp, p, mf, f, ffcomparably vague instructions on volume modifications, e.g.:crescendo, decrescendo, sfdynamics influenced by

instrumentationtimbrenumber of voicescontext and musical tension

MIDI:128 velocity stepsno standardized relation to magnitude, power, . . .

overview intro perception music features summary

intensity, magnitude & loudnessdynamics in music

score:

only several rough dynamic steps,e.g.:pp, p, mf, f, ffcomparably vague instructions on volume modifications, e.g.:crescendo, decrescendo, sfdynamics influenced by

instrumentationtimbrenumber of voicescontext and musical tension

MIDI:128 velocity stepsno standardized relation to magnitude, power, . . .

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: root mean square 1/2

vRMS(n) =

√√√√√ 1

K

ie(n)∑i=is(n)

x(i)2

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: root mean square 1/2

vRMS(n) =

√√√√√ 1

K

ie(n)∑i=is(n)

x(i)2

value of this feature for the hypothetical prototype signalssilencesinusoidal (Amplitude A)

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: root mean square 1/2

vRMS(n) =

√√√√√ 1

K

ie(n)∑i=is(n)

x(i)2

5 10 15 20 25

f [k

Hz]

0

5

10

t [s]

0 5 10 15 20 25

x(i)

-1

0

1

v(n

)

-50

ma

tla

bso

urc

e:m

atl

ab

/d

isp

layF

eatu

res.

m

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: root mean square 2/2

common variants (sample processing only):

reduce computational complexity

v 2RMS(n) =

x(ie(n))2 − x(is(n − 1))2

ie(n)− is(n) + 1+ v 2

RMS(n − 1)

vRMS(n) =√

v 2RMS(n)

single pole approximation

vtmp(i) = α · vtmp(i − 1) + (1− α) · x(i)2

v∗RMS(i) =

√vtmp(i)

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: root mean square 2/2

common variants (sample processing only):

reduce computational complexity

v 2RMS(n) =

x(ie(n))2 − x(is(n − 1))2

ie(n)− is(n) + 1+ v 2

RMS(n − 1)

vRMS(n) =√

v 2RMS(n)

single pole approximation

vtmp(i) = α · vtmp(i − 1) + (1− α) · x(i)2

v∗RMS(i) =

√vtmp(i)

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: weighted root mean square

x(i)H(z) RMS

v(n)

H(z):

A, B, C weighting

RLB (BS.1770)

. . .

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: weighted root mean square

x(i)H(z) RMS

v(n)

H(z):

A, B, C weighting

RLB (BS.1770)

. . .

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: weighted root mean square

x(i)H(z) RMS

v(n)

H(z):

A, B, C weightingRLB (BS.1770). . .

f [Hz]

102 103 104

|H(f

)|[dB]

-20

-10

0

10

BS.1770 MC

ITU-R BS.468

A Weighting

C Weighting

Z Weighting ma

tla

bso

urc

e:m

atl

ab

/d

isp

layL

ou

dn

essW

eig

hti

ng

.m

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: peak envelope (max)

vPeak(n) = maxis(n)≤i≤ie(n)

|x(i)|

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: peak envelope (max)

vPeak(n) = maxis(n)≤i≤ie(n)

|x(i)|

5 10 15 20 25

f [k

Hz]

0

5

10

t [s]

0 5 10 15 20 25

x(i)

-1

0

1

v(n

)

-50

ma

tla

bso

urc

e:m

atl

ab

/d

isp

layF

eatu

res.

m

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: peak envelope (PPM) 1/2

⊕ ⊗ ⊕ ⊕

z−1

x(i) |x(i)|

αAT

λ

vPPM(i)

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: peak envelope (PPM) 1/2

⊕ ⊗ ⊕ ⊕

z−1

x(i) |x(i)|

αAT

λ

vPPM(i)

release state (|x(i)| < vPPM(i − 1)⇒ λ = αRT)

vPPM(i) = vPPM(i − 1)− αRT · vPPM(i − 1)

= (1− αRT) · vPPM(i − 1)

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: peak envelope (PPM) 1/2

⊕ ⊗ ⊕ ⊕

z−1

x(i) |x(i)|

αAT

λ

vPPM(i)

release state (|x(i)| < vPPM(i − 1)⇒ λ = αRT)

vPPM(i) = vPPM(i − 1)− αRT · vPPM(i − 1)

= (1− αRT) · vPPM(i − 1)

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: peak envelope (PPM) 1/2

⊕ ⊗ ⊕ ⊕

z−1

x(i) |x(i)|

αAT

λ

vPPM(i)

attack state (|x(i)| ≥ vPPM(i − 1)⇒ λ = 0)

vPPM(i) = αAT ·(|x(i)| − vPPM(i − 1)

)+ vPPM(i − 1)

= αAT · |x(i)|+ (1− αAT) · vPPM(i − 1)

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: peak envelope (PPM) 1/2

⊕ ⊗ ⊕ ⊕

z−1

x(i) |x(i)|

αAT

λ

vPPM(i)

attack state (|x(i)| ≥ vPPM(i − 1)⇒ λ = 0)

vPPM(i) = αAT ·(|x(i)| − vPPM(i − 1)

)+ vPPM(i − 1)

= αAT · |x(i)|+ (1− αAT) · vPPM(i − 1)

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: peak envelope (PPM) 2/2

5 10 15 20 25

f [k

Hz]

0

5

10

t [s]

0 5 10 15 20 25

x(i)

-1

0

1

v(n

)

-50

discuss differences between peak meter and maxper block

ma

tla

bso

urc

e:m

atl

ab

/d

isp

layF

eatu

res.

m

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: peak envelope (PPM) 2/2

5 10 15 20 25

f [k

Hz]

0

5

10

t [s]

0 5 10 15 20 25

x(i)

-1

0

1

v(n

)

-50

discuss differences between peak meter and maxper block

ma

tla

bso

urc

e:m

atl

ab

/d

isp

layF

eatu

res.

m

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: peak envelope (PPM) 2/2

5 10 15 20 25

f [k

Hz]

0

5

10

t [s]

0 5 10 15 20 25

x(i)

-1

0

1

v(n

)

-50

discuss differences between peak meter and maxper block

ma

tla

bso

urc

e:m

atl

ab

/d

isp

layF

eatu

res.

m

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: zwicker loudness

Stimulus Outer EarTransferFunction

ExcitationPatterns

SpecificLoudness

OverallLoudness

vLoud

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: zwicker loudness

Stimulus Outer EarTransferFunction

ExcitationPatterns

SpecificLoudness

OverallLoudness

vLoud

outer ear transfer function1

1D. Hammershøi and H. Møller, “Methods for Binaural Recording and Reproduction,” Acta Acustica united

with Acustica, vol. 88, no. 3, pp. 303–311, May 2002.

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: zwicker loudness

Stimulus Outer EarTransferFunction

ExcitationPatterns

SpecificLoudness

OverallLoudness

vLoud

excitation patterns1

1M. Schleske, Vibrato of the musician, [Online]. Available:

http://www.schleske.de/en/our-research/handbook-violinacoustics/vibrato-of-the-musician.html

(visited on 07/29/2015).

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: zwicker loudness

Stimulus Outer EarTransferFunction

ExcitationPatterns

SpecificLoudness

OverallLoudness

vLoud

specific loudness1

1U. of Salford, Customised metrics, [Online]. Available: https://www.salford.ac.uk/computing-science-

engineering/research/acoustics/psychoacoustics/sound-quality-making-products-sound-

better/sound-quality-testing/customised-metrics (visited on 07/29/2015).

overview intro perception music features summary

intensity, magnitude & loudnessfeatures: zwicker loudness

Stimulus Outer EarTransferFunction

ExcitationPatterns

SpecificLoudness

OverallLoudness

vLoud

overall loudnessvloud =

∑∀i

zi

overview intro perception music features summary

intensity, magnitude & loudnessderived features

number or ratio of pauses

dynamic range

statistical features from (RMS) histogram

. . .

overview intro perception music features summary

summarylecture content

1 why are intensity-related features often in dB

2 how does the dB-Scale relate to loudness

3 what are typical intensity-related features

overview intro perception music features summary

summarylecture content

1 why are intensity-related features often in dB

2 how does the dB-Scale relate to loudness

3 what are typical intensity-related features

overview intro perception music features summary

summarylecture content

1 why are intensity-related features often in dB

2 how does the dB-Scale relate to loudness

3 what are typical intensity-related features

Recommended