Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
overview intro perception music features summary
MUSI-6201 — Computational Music AnalysisPart 5.1: Intensity
alexander lerch
November 4, 2015
overview intro perception music features summary
instantaneous featuresoverview
text bookChapter 4: Intensity (pp. 71–78)
sources: slides (latex) & Matlab
github repository
lecture contentloudness perception and decibelsdynamics in musicinstantaneous features
overview intro perception music features summary
instantaneous featuresoverview
text bookChapter 4: Intensity (pp. 71–78)
sources: slides (latex) & Matlab
github repository
lecture contentloudness perception and decibelsdynamics in musicinstantaneous features
overview intro perception music features summary
instantaneous featuresoverview
text bookChapter 4: Intensity (pp. 71–78)
sources: slides (latex) & Matlab
github repository
lecture contentloudness perception and decibelsdynamics in musicinstantaneous features
overview intro perception music features summary
instantaneous featuresoverview
text bookChapter 4: Intensity (pp. 71–78)
sources: slides (latex) & Matlab
github repository
lecture contentloudness perception and decibelsdynamics in musicinstantaneous features
overview intro perception music features summary
intensity, magnitude & loudnessintroduction
intensity-related descriptors commonly usedwaveform view
t [s]
0 5 10
x(t
) (p
op
)
-1
-0.5
0
0.5
1
t [s]
0 5 10
x(t
) (s
trin
gq
ua
rte
t)
-1
-0.5
0
0.5
1
t [s]
0 5 10
x(t
) (s
pe
ech
)
-1
-0.5
0
0.5
1
level monitoring (PPM, VU,. . . )
terms and definitions
magnitudeintensityenvelopelevelvolumevelocityloudness
overview intro perception music features summary
intensity, magnitude & loudnessintroduction
intensity-related descriptors commonly usedwaveform view
t [s]
0 5 10
x(t
) (p
op
)
-1
-0.5
0
0.5
1
t [s]
0 5 10
x(t
) (s
trin
gq
ua
rte
t)
-1
-0.5
0
0.5
1
t [s]
0 5 10
x(t
) (s
pe
ech
)
-1
-0.5
0
0.5
1
level monitoring (PPM, VU,. . . )
terms and definitionsmagnitudeintensityenvelopelevelvolumevelocityloudness
overview intro perception music features summary
intensity, magnitude & loudnesshuman perception 1/2
perception has non-linear relation to intensity:
model: logarithmic relation
vdB(n) = 20 · log10
(v(n)
v0
)v0: reference constant (0 dB point)digital: v0 = 1 ⇒ dBFSscaling factor: 1 dB ≈ JNDL
overview intro perception music features summary
intensity, magnitude & loudnesshuman perception 1/2
perception has non-linear relation to intensity:
model: logarithmic relation
vdB(n) = 20 · log10
(v(n)
v0
)v0: reference constant (0 dB point)digital: v0 = 1 ⇒ dBFSscaling factor: 1 dB ≈ JNDL
overview intro perception music features summary
intensity, magnitude & loudnesshuman perception 1/2
perception has non-linear relation to intensity:
model: logarithmic relation
vdB(n) = 20 · log10
(v(n)
v0
)v0: reference constant (0 dB point)digital: v0 = 1 ⇒ dBFSscaling factor: 1 dB ≈ JNDL
overview intro perception music features summary
intensity, magnitude & loudnessexcursion: level computation
if v(n) = 0 ⇒: computation of log10(0)
work-aroundsa add constant ε
vdB(n) = 20 · log10(v(n) + ε)
b add if statement
vtrunc(n) =
{v(n), if v(n) ≥ εε, otherwise
overview intro perception music features summary
intensity, magnitude & loudnessexcursion: level computation
if v(n) = 0 ⇒: computation of log10(0)work-arounds
a add constant ε
vdB(n) = 20 · log10(v(n) + ε)
vdB [dB]
-40 -35 -30 -25 -20 -15 -10 -5 0
v′ dB−
vdB[dB]
0
5
10
15
20
ǫ = 1e − 01
ǫ = 1e − 02
ǫ = 1e − 03
ǫ = 1e − 04
b add if statement
vtrunc(n) =
{v(n), if v(n) ≥ εε, otherwise
ma
tla
bso
urc
e:m
atl
ab
/d
isp
layL
og
Ep
silo
n.m
overview intro perception music features summary
intensity, magnitude & loudnessexcursion: level computation
if v(n) = 0 ⇒: computation of log10(0)
work-aroundsa add constant ε
vdB(n) = 20 · log10(v(n) + ε)
b add if statement
vtrunc(n) =
{v(n), if v(n) ≥ εε, otherwise
overview intro perception music features summary
intensity, magnitude & loudnesshuman perception 2/2
decibel scale is not loudness scale:
equal-sized steps on the decibel scale not perceived asequal-sized loudness steps
perceptual loudness depends on
frequencycochlear resolutionmasking effects
overview intro perception music features summary
intensity, magnitude & loudnesshuman perception 2/2
decibel scale is not loudness scale:
equal-sized steps on the decibel scale not perceived asequal-sized loudness steps
perceptual loudness depends on
frequencycochlear resolutionmasking effects
overview intro perception music features summary
intensity, magnitude & loudnesshuman perception 2/2
f [Hz]
102 103 104
SPL
[dB]
0
20
40
60
80
100
120
0 phon
20 phon
40 phon
60 phon
80 phon
90 phon
ma
tla
bso
urc
e:m
atl
ab
/d
isp
layE
qu
alL
ou
dn
essC
on
tou
rs.m
overview intro perception music features summary
intensity, magnitude & loudnessdynamics in music
score:
only several rough dynamic steps,e.g.:pp, p, mf, f, ffcomparably vague instructions on volume modifications, e.g.:crescendo, decrescendo, sfdynamics influenced by
instrumentationtimbrenumber of voicescontext and musical tension
MIDI:128 velocity stepsno standardized relation to magnitude, power, . . .
overview intro perception music features summary
intensity, magnitude & loudnessdynamics in music
score:
only several rough dynamic steps,e.g.:pp, p, mf, f, ffcomparably vague instructions on volume modifications, e.g.:crescendo, decrescendo, sfdynamics influenced by
instrumentationtimbrenumber of voicescontext and musical tension
MIDI:128 velocity stepsno standardized relation to magnitude, power, . . .
overview intro perception music features summary
intensity, magnitude & loudnessdynamics in music
score:
only several rough dynamic steps,e.g.:pp, p, mf, f, ffcomparably vague instructions on volume modifications, e.g.:crescendo, decrescendo, sfdynamics influenced by
instrumentationtimbrenumber of voicescontext and musical tension
MIDI:128 velocity stepsno standardized relation to magnitude, power, . . .
overview intro perception music features summary
intensity, magnitude & loudnessdynamics in music
score:
only several rough dynamic steps,e.g.:pp, p, mf, f, ffcomparably vague instructions on volume modifications, e.g.:crescendo, decrescendo, sfdynamics influenced by
instrumentationtimbrenumber of voicescontext and musical tension
MIDI:128 velocity stepsno standardized relation to magnitude, power, . . .
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: root mean square 1/2
vRMS(n) =
√√√√√ 1
K
ie(n)∑i=is(n)
x(i)2
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: root mean square 1/2
vRMS(n) =
√√√√√ 1
K
ie(n)∑i=is(n)
x(i)2
value of this feature for the hypothetical prototype signalssilencesinusoidal (Amplitude A)
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: root mean square 1/2
vRMS(n) =
√√√√√ 1
K
ie(n)∑i=is(n)
x(i)2
5 10 15 20 25
f [k
Hz]
0
5
10
t [s]
0 5 10 15 20 25
x(i)
-1
0
1
v(n
)
-50
ma
tla
bso
urc
e:m
atl
ab
/d
isp
layF
eatu
res.
m
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: root mean square 2/2
common variants (sample processing only):
reduce computational complexity
v 2RMS(n) =
x(ie(n))2 − x(is(n − 1))2
ie(n)− is(n) + 1+ v 2
RMS(n − 1)
vRMS(n) =√
v 2RMS(n)
single pole approximation
vtmp(i) = α · vtmp(i − 1) + (1− α) · x(i)2
v∗RMS(i) =
√vtmp(i)
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: root mean square 2/2
common variants (sample processing only):
reduce computational complexity
v 2RMS(n) =
x(ie(n))2 − x(is(n − 1))2
ie(n)− is(n) + 1+ v 2
RMS(n − 1)
vRMS(n) =√
v 2RMS(n)
single pole approximation
vtmp(i) = α · vtmp(i − 1) + (1− α) · x(i)2
v∗RMS(i) =
√vtmp(i)
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: weighted root mean square
x(i)H(z) RMS
v(n)
H(z):
A, B, C weighting
RLB (BS.1770)
. . .
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: weighted root mean square
x(i)H(z) RMS
v(n)
H(z):
A, B, C weighting
RLB (BS.1770)
. . .
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: weighted root mean square
x(i)H(z) RMS
v(n)
H(z):
A, B, C weightingRLB (BS.1770). . .
f [Hz]
102 103 104
|H(f
)|[dB]
-20
-10
0
10
BS.1770 MC
ITU-R BS.468
A Weighting
C Weighting
Z Weighting ma
tla
bso
urc
e:m
atl
ab
/d
isp
layL
ou
dn
essW
eig
hti
ng
.m
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: peak envelope (max)
vPeak(n) = maxis(n)≤i≤ie(n)
|x(i)|
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: peak envelope (max)
vPeak(n) = maxis(n)≤i≤ie(n)
|x(i)|
5 10 15 20 25
f [k
Hz]
0
5
10
t [s]
0 5 10 15 20 25
x(i)
-1
0
1
v(n
)
-50
ma
tla
bso
urc
e:m
atl
ab
/d
isp
layF
eatu
res.
m
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: peak envelope (PPM) 1/2
⊕ ⊗ ⊕ ⊕
z−1
⊗
x(i) |x(i)|
−
αAT
λ
−
vPPM(i)
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: peak envelope (PPM) 1/2
⊕ ⊗ ⊕ ⊕
z−1
⊗
x(i) |x(i)|
−
αAT
λ
−
vPPM(i)
release state (|x(i)| < vPPM(i − 1)⇒ λ = αRT)
vPPM(i) = vPPM(i − 1)− αRT · vPPM(i − 1)
= (1− αRT) · vPPM(i − 1)
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: peak envelope (PPM) 1/2
⊕ ⊗ ⊕ ⊕
z−1
⊗
x(i) |x(i)|
−
αAT
λ
−
vPPM(i)
release state (|x(i)| < vPPM(i − 1)⇒ λ = αRT)
vPPM(i) = vPPM(i − 1)− αRT · vPPM(i − 1)
= (1− αRT) · vPPM(i − 1)
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: peak envelope (PPM) 1/2
⊕ ⊗ ⊕ ⊕
z−1
⊗
x(i) |x(i)|
−
αAT
λ
−
vPPM(i)
attack state (|x(i)| ≥ vPPM(i − 1)⇒ λ = 0)
vPPM(i) = αAT ·(|x(i)| − vPPM(i − 1)
)+ vPPM(i − 1)
= αAT · |x(i)|+ (1− αAT) · vPPM(i − 1)
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: peak envelope (PPM) 1/2
⊕ ⊗ ⊕ ⊕
z−1
⊗
x(i) |x(i)|
−
αAT
λ
−
vPPM(i)
attack state (|x(i)| ≥ vPPM(i − 1)⇒ λ = 0)
vPPM(i) = αAT ·(|x(i)| − vPPM(i − 1)
)+ vPPM(i − 1)
= αAT · |x(i)|+ (1− αAT) · vPPM(i − 1)
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: peak envelope (PPM) 2/2
5 10 15 20 25
f [k
Hz]
0
5
10
t [s]
0 5 10 15 20 25
x(i)
-1
0
1
v(n
)
-50
discuss differences between peak meter and maxper block
ma
tla
bso
urc
e:m
atl
ab
/d
isp
layF
eatu
res.
m
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: peak envelope (PPM) 2/2
5 10 15 20 25
f [k
Hz]
0
5
10
t [s]
0 5 10 15 20 25
x(i)
-1
0
1
v(n
)
-50
discuss differences between peak meter and maxper block
ma
tla
bso
urc
e:m
atl
ab
/d
isp
layF
eatu
res.
m
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: peak envelope (PPM) 2/2
5 10 15 20 25
f [k
Hz]
0
5
10
t [s]
0 5 10 15 20 25
x(i)
-1
0
1
v(n
)
-50
discuss differences between peak meter and maxper block
ma
tla
bso
urc
e:m
atl
ab
/d
isp
layF
eatu
res.
m
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: zwicker loudness
Stimulus Outer EarTransferFunction
ExcitationPatterns
SpecificLoudness
OverallLoudness
vLoud
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: zwicker loudness
Stimulus Outer EarTransferFunction
ExcitationPatterns
SpecificLoudness
OverallLoudness
vLoud
outer ear transfer function1
1D. Hammershøi and H. Møller, “Methods for Binaural Recording and Reproduction,” Acta Acustica united
with Acustica, vol. 88, no. 3, pp. 303–311, May 2002.
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: zwicker loudness
Stimulus Outer EarTransferFunction
ExcitationPatterns
SpecificLoudness
OverallLoudness
vLoud
excitation patterns1
1M. Schleske, Vibrato of the musician, [Online]. Available:
http://www.schleske.de/en/our-research/handbook-violinacoustics/vibrato-of-the-musician.html
(visited on 07/29/2015).
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: zwicker loudness
Stimulus Outer EarTransferFunction
ExcitationPatterns
SpecificLoudness
OverallLoudness
vLoud
specific loudness1
1U. of Salford, Customised metrics, [Online]. Available: https://www.salford.ac.uk/computing-science-
engineering/research/acoustics/psychoacoustics/sound-quality-making-products-sound-
better/sound-quality-testing/customised-metrics (visited on 07/29/2015).
overview intro perception music features summary
intensity, magnitude & loudnessfeatures: zwicker loudness
Stimulus Outer EarTransferFunction
ExcitationPatterns
SpecificLoudness
OverallLoudness
vLoud
overall loudnessvloud =
∑∀i
zi
overview intro perception music features summary
intensity, magnitude & loudnessderived features
number or ratio of pauses
dynamic range
statistical features from (RMS) histogram
. . .
overview intro perception music features summary
summarylecture content
1 why are intensity-related features often in dB
2 how does the dB-Scale relate to loudness
3 what are typical intensity-related features
overview intro perception music features summary
summarylecture content
1 why are intensity-related features often in dB
2 how does the dB-Scale relate to loudness
3 what are typical intensity-related features
overview intro perception music features summary
summarylecture content
1 why are intensity-related features often in dB
2 how does the dB-Scale relate to loudness
3 what are typical intensity-related features