Upload
edgar-potter
View
217
Download
1
Embed Size (px)
Citation preview
Department of Electrical & Computer Engineering
NoteFor this lecture many of the slides will be
accompanied by scanned pictures shown on the OHP from Zwicker and Fastl’s
“Psycho-acoustics facts and models” 2nd edition
Department of Electrical & Computer Engineering
Main Outline
• Anatomy of the Ear and Hearing DONE
• Auditory perception• Hearing aids and Cochlear implants.
Extra: Direction of Arrival Estimation
Department of Electrical & Computer Engineering
Auditory perception
• Shepard Tones
• Masking <Detailed look>
• Ohms Acoustic Law
• Critical Bands
• Webers law
• Just Noticeable Frequency
Department of Electrical & Computer Engineering
Roger Penrose M.C. Escher
Ascending and Descending
Optical Illusion
Audio
Illusion
Department of Electrical & Computer Engineering
Shepard Tones• Circularity in Judgments of Relative Pitch, Roger N. Shepard,
JASA 1964.– Sensitivity to descending pitch – Sensitivity to volume changes between these
pitches.• A set of eight tones all an octave apart • The tones simultaneously descend in pitch till half of
their original pitch.• Jump back up to their original pitch and repeat the cycle.• Perceive this change?• Unique volume curve• Effect: Seamless transition in the cycle.• It’s all in your head!
• Omit two of the eight tones in the mid frequency range.
Department of Electrical & Computer Engineering
You know I can't hear you when the water is running!
MASKING
Department of Electrical & Computer Engineering
Masking
• Low-frequency, broad banded sounds (like water running) will mask higher frequency sounds which are softer at the listener's ear (a conversational tone from across the room).– Example 2: Truck in street
• Masking occurs because two frequencies lie within a critical band and the higher amplitude one masks the lower amplitude signal.
• Masking can be because of broad band, narrowband noise, pure and complex tones.
• Masking threshold– Amount of dB for test tone to be just audible in presence of noise
See OHP Figure
Department of Electrical & Computer Engineering
Masking by Broad band noise
• White noise- frequency independent PSD• Masked thresholds are a function of frequency.• Low and very high frequency almost same as TOQ.• Above 500Hz, thresholds increase with increase in frequency• Increasing white noise by 10dB increases masked threshold up by
10dB for frequencies >500Hz.• =>Linear behavior of masking• NOTE: TOQ’s frequency dependence almost completely
disappears Ear’s frequency selectivity and critical bands.
See OHP Figure
Department of Electrical & Computer Engineering
Masking by Narrow band noise
• Narrow band <=Critical BW• Noise (constant Amplitude, Different Frequency)
– 0.25,1,4KHz – BW: 100, 160, 700Hz– 60dB
• Frequency dependence of threshold masked by 250Hz seems to be broader
• Maximum value of masked threshold is lower for higher frequencies.• Steep increase but flatter decrease
See OHP Figure
Department of Electrical & Computer Engineering
Masking by Narrow band noise (cont)
• Noise (Varying Amplitude, Fixed Frequency)– 1KHz noise– 20-100dB
• Slope of rise seems independent of Amplitude• But slope of fall is dependent on amplitude• Non-Linear frequency dependence• Strange effect at high masker amplitudes:
– At high amplitudes ear begins to listen to anything audible!!– Begin to hear difference noise (noise and testing tone)
See OHP Figure
Department of Electrical & Computer Engineering
Masking by Pure and Complex tones• Pure tones:
– Below threshold of Quiet of test tone can hear only masking tone– Above it <700hZ can hear both– From 900-10kHz can hear only masking tone though above threshold of
hearing for test tone.– Between 1-2kHz difference tones are also audible– Low level masker wider at low frequencies– High level maskers wider at high frequencies
• Complex tones:– Log scale distance between the partials has a larger difference at LF,
less difference at HF– Dips correspondingly become smaller as frequency increases– 2 octaves above highest spectral content curve approaches TOQ
See OHP Figure
Department of Electrical & Computer Engineering
Temporal Aspects of Masking
• Previously assume long lasting test and masking sounds• Speech has a strong temporal structure• Vowels --loudest parts• Consonants faint• Often plosive consonants are masked by preceding loud vowel
Department of Electrical & Computer Engineering
Temporal Aspects of Masking (cont)
• Simultaneous Masking• Pre-Stimulus/Backward/Premasking
– 1st test tone 2nd Masker
• Poststimulus/Forward/Postmasking– 1st Masker 2nd test tone
Department of Electrical & Computer Engineering
Types of MaskingSimultaneous masking
– Duration less than 200ms test tone threshold increases with decrease in duration.
– Duration >200ms constant test tone threshold– Assume hearing system integrates over a period of 200ms
Postmasking (100ms)– Decay in effect of masker 100ms– More dominant
Premasking (20ms)– Takes place before masker is on!!– Each sensation is not instantaneous , requires build-up time
• Quick build up for loud maskers• Slower build up for softer maskers
– Less dominant effect
See OHP Figure
Department of Electrical & Computer Engineering
Ohm’s Acoustic LawThe sound quality of a complex tone depends ONLY on the
amplitudes and NOT relative phases of its harmonics.
Department of Electrical & Computer Engineering
Critical Bands• Proposed by Fletcher• Noise which masks a test tone is the part of its spectrum which lies
near the tone
• Masking is achieved when the power of the tone and the power of the noise spectrum lying near the tone and masking it are the same.
• Bands defined this way have a BW which produces same acoustic power in the tone and in the noise in the band when the tone is masked. CRITICAL BANDS
See OHP Figure
Department of Electrical & Computer Engineering
Critical Band (cont.)
• How to measure?– Masking of a band pass noise using 2 tones
• CB corresponds with1.5mm spacing on BM.• 24 such band pass filters• BW of the filters increases with increasing center frequency• Logarithmic relationship Weber’s law example.• Bark scale
See OHP Figure
Department of Electrical & Computer Engineering
Webers law
• Weber's Law states that the ratio of the increment threshold to the background intensity is a constant.
• So when you are in a noisy environment you must shout to be heard while a whisper works in a quiet room.
• when you measure increment thresholds on various intensity backgrounds, the thresholds increase in proportion to the background.
Department of Electrical & Computer Engineering
Just noticeable change in Frequency• (Pg:183)
• Similar to variation in the critical band structure• This is because it depends on number of BPFs• More BPF better resolution• Till about 500Hz JND is about 3.6Hz.• After 500Hz it varies as 0.007f
See OHP Figure
Department of Electrical & Computer Engineering
Outline
• Facts on hearing loss
• Cell phones and hearing loss
• Types of Hearing aid
• Inside a hearing aid
• Audiogram
Department of Electrical & Computer Engineering
Facts on Hearing Loss in Adults
• One in every ten (28 million) Americans has hearing loss.
• The vast majority of Americans (95% or 26 million) with hearing loss can have their hearing loss treated with hearing aids.
• Only 5% of hearing loss in adults can be improved through medical or surgical treatment
• Millions of Americans with hearing loss could benefit from hearing aids but avoid them because of the stigma.
Department of Electrical & Computer Engineering
Cell phones and Hearing aids
• Cell Phones emit a type of electromagnetic energy that interferes with the operation of hearing aids.
• The Federal Communications Commission in mid-July 2003 ordered the cell phone industry to help out the hard-of-hearing.
“Within two years, cell-phone manufacturers must offer at least two phones with reduced interference for each type of cellular technology used, or ensure that one-fourth of phones the carriers sell produce less interference.”
• The FCC’s final milestone is February 2008.
Department of Electrical & Computer Engineering
Types of Hearing aids
Behind The earIn the Ear
In the Canal Completely in the canal
Department of Electrical & Computer Engineering
Anatomy of a Hearing Aid
• Microphone• Tone hook• Volume control• On/off switch
• Battery compartment
Department of Electrical & Computer Engineering
Inside a Hearing aid
1: The microphone The microphone picks up sound waves from the air and transforms them into electrical signals.
2: The microphone suspension The microphone suspension holds the microphone in place.
3: The loudspeaker The loudspeaker sends the amplified sounds into your ear. The loudspeaker is also called the receiver and sometimes the telephone.
4: The battery drawer The battery drawer holds the battery in place.
5: The amplifier The amplifier makes the signals that come from the microphone louder.
6: The telecoil The telecoil makes it possible for you to hear one specific person if you are in a place that supports the use of a telecoil. Many classrooms, churches and cinemas have telecoil. The telecoil makes it possible for you to hear i.e. your teacher without hearing the noise around you. It is also possible to use the telecoil at home - with the TV or the radio.
Department of Electrical & Computer Engineering
Direction of Arrival (DOA) estimation algorithm
<Useful for class project ideas>
Department of Electrical & Computer Engineering
Talk outline
• Necessity for DOA• DOA algorithm Requirements• Types of DOA algorithms
– Delay and sum– Minimum variance– MUSIC– Coherent MUSIC– Root MUSIC– ESPRIT
• Comparison Measures• Computational Intensity comparison• Accuracy Comparison• Accuracy vs Computational intensity• Conclusion
Department of Electrical & Computer Engineering
Where does the DOA come into the picture?
Has 2 microphones
DOA Estimation sn
Beamformer
Lets meet at 11?!?
7 is good for me too!!
11 sounds good!!
Department of Electrical & Computer Engineering
Direction of Arrival Estimation Algorithms
The DOA algorithm must satisfy the following conditions :
– Low computational intensity (MIPS/MFLOPS)
– High accuracy (RMSE)
– High speed – Easy implementation– Good performance at low SNRs– Works on a 2 microphone array system with 4cm
separation between them.
Department of Electrical & Computer Engineering
DOA Algorithms
Spatial Correlation methods
Subspace decompositionmethods
MUSIC Multiple Signal Estimation
ESPRITEstimation of Signal parameters
using rotational invariance
Delay and Sum Minimum Variance
Coherent MUSIC
Root MUSIC
Department of Electrical & Computer Engineering
DOA Method Equation for Implementation
Delay and Sum
Minimum Variance
MUSIC
Coherent MUSIC
Root MUSIC
ESPRIT
*( ) ( ) ( )P a Sa
*
1( )
( ) ( ) ( )P
a inv s a
*
* *
( ) ( )( )
( ) ( )N N
a aP
a E E a
'( )
' 'N N
a aP
a E E a
1
0sin . ( ) /( )K c angle z d
1
0sin arg( ) /( )
K Kc d
Department of Electrical & Computer Engineering
Comparison Measures
• To evaluate the computational intensity– MFLOPS comparison plot
• To evaluate the accuracy– Root Mean Square Error comparison plot
• To evaluate the effect at low SNRs– SNR vs Estimated angle plot
• To evaluate overall performance– Accuracy vs computational intensity
Department of Electrical & Computer Engineering
Evaluation of computational Intensity:MFLOPS comparison chart Min Variance
0.93 Mflops
Coherent MUSIC
0.3958 Mflops
DS
0.3573 Mflops
MUSIC
0.0813 Mflops
ESPRIT
0.0086 Mflops
Root MUSIC
0.0068 Mflops
Department of Electrical & Computer Engineering
Comparison of accuracy at different SNR values
0 5 10 15 20 250
20
40
60
80
100
120
140
160
180
Frame Number
Est
imat
ed D
OA
(de
gree
s)
Comparison of Estimated DOAs: SNR=10dB,Speech= 90' ,Noise=0'
ESPRITRMUSICCMUSICMUSICMVDS
0 5 10 150
20
40
60
80
100
120
140
160
180Estimated DOAS for only those regions which are speech
Frames Number
Est
imat
ed D
OA
(D
egre
es)
Department of Electrical & Computer Engineering
Conclusion
• Tradeoff between Accuracy and Computational intensity leads to the conclusion that ESPRIT is the Direction of arrival estimation algorithm best suited for our purpose
• MFLOPS value: 0.0086 • RMSE value:~3 (at 10dB)
Department of Electrical & Computer Engineering
0 5 10 15 20 250
20
40
60
80
100
120
140
160
180
Frame Number
Est
imat
ed D
OA
(de
gree
s)
Comparison of Estimated DOAs: SNR=10dB,Speech= 90' ,Noise=0'
ESPRITRMUSICCMUSICMUSICMVDS
Department of Electrical & Computer Engineering
0 5 10 150
20
40
60
80
100
120
140
160
180Estimated DOAS for only those regions which are speech
Frames Number
Estim
ate
d D
OA
(D
egre
es)