Upload
annabel-blake
View
218
Download
0
Tags:
Embed Size (px)
Citation preview
Blind Separation of Speech MixturesBlind Separation of Speech Mixtures
Vaninirappuputhenpurayil Gopalan REJU
School of Electrical and Electronic EngineeringNanyang Technological University
Vaninirappuputhenpurayil Gopalan REJU
School of Electrical and Electronic EngineeringNanyang Technological University
1021 PM 1
Introduction Introduction
Blind Source Separation Blind Source Separation
1021 PM
0
2120
1111 )()()()()(ll
ltslhltslhtx
0
2220
1212 )()()()()(ll
ltslhltslhtx
L
l
L
l
ltxlwltxlwty0
2120
1111 )()()()()(
L
l
L
l
ltxlwltxlwty0
2220
1212 )()()()()(
bull Mixing process
bull Unmixing process
ConvolutiveConvolutive
2
s1
s2
Introduction Introduction
Convolutive Blind Source Separation Convolutive Blind Source Separation Instantaneous Blind Source Separation Instantaneous Blind Source Separation
1021 PM 3
Introduction Introduction
Convolutive Blind Source Separation Convolutive Blind Source Separation Instantaneous Blind Source Separation Instantaneous Blind Source Separation
)(
)(
)(
)(
2
1
2221
1211
2
1
ts
ts
tx
tx
hh
hh
)()( tt SX H
)(
)(
)(
)(
2
1
2221
1211
2
1
ts
ts
hh
hh
tx
tx
)()( tt HSX
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
tfS
tfS
fHfH
fHfH
tfX
tfX
)()()( tfftf SX H
bull In frequency domain
Difficu
lt to
separa
te
Difficu
lt to
separa
te
Easy to
separa
te
Easy to
separa
te
1021 PM 4
Introduction Introduction
No of sources lt No of sensorNo of sources lt No of sensor1s
2s
3s
4s
1x2x3x
1s
2s1x2x3x
3s
1s
2s 1x2x3x
4s
No of sources = No of sensorNo of sources = No of sensor
No of sources gt No of sensorNo of sources gt No of sensor
Overdetermined mixingOverdetermined mixing
Determined mixingDetermined mixing
Underdetermined mixingUnderdetermined mixing
Difficult to se
parate
Difficult to se
parate
Easy to se
parate
Easy to se
parate
1021 PM 5
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Types of mixing
Instantaneous mixing Convolutive mixing
1021 PM 6
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixing
Step 1 Selection of cost function
Step 2 Minimization or maximization of the cost function
1021 PM
WHS1
S2 X2
Y1
Y2
Separated
X1
7
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsInstantaneous mixingInstantaneous mixing
Selection of cost function
Statistical independence
Information theoretic
Non-Gaussianity
Kurtosis
Negentropy
Nonlinear cross moments
Temporal structure of speech
Non-stationarity of speech1021 PM
i
ii ypp y is idea Basic
Central limit theoremMixture of two or more sources will be more Gaussian than their individual components
224 E3E)( yyykurt
yyy HHJ gauss
yyyy dppH logEntropy
0 ji ygyfE
used becan statsticsorder Secondlags timedifferent for usly simultaneon correlatiooutput theedioganaliz eg
eddiagonalizusly simultaneo are matricesn correlatio the
and blocks into divided are Signals
Non Gaussianity measures
Signals from two different sources are independent
8
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixingMinimization or maximization of the cost function
simple gradient method
Natural gradient method
Newtonrsquos method
eg Informax ICA algorithm
eg FastICA
1021 PM 9
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsConvolutive Mixing
Time Domain Frequency Domain
QqltxlwyP
p
L
lpqpq 1)()(
1
1
0
)()()( tfftf SHX
)()()( tfftf XWY
AdvantageNo permutation problem
DisadvantageSlow convergence High computational cost for long filter taps
AdvantageLow computational costFast convergence
DisadvantagePermutation Problem
WHS1
S2
X1
X2
Y1 Y2
Y2 Y1
1021 PM 10
or
Permutation Problem in Frequency Domain BSS
Permutation Problem in Frequency Domain BSS
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Still signals are mixed
K po
int
IFFT
Corresponding to different sources Due to permutation problem
One frequency binInstantaneous ICA algorithm
Solv
ing
perm
utati
on
Prob
lem y1
y2y3
Separated signals
Corresponding to y3
1021 PM 11
MotivationMotivation
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
12
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Introduction Introduction
Blind Source Separation Blind Source Separation
1021 PM
0
2120
1111 )()()()()(ll
ltslhltslhtx
0
2220
1212 )()()()()(ll
ltslhltslhtx
L
l
L
l
ltxlwltxlwty0
2120
1111 )()()()()(
L
l
L
l
ltxlwltxlwty0
2220
1212 )()()()()(
bull Mixing process
bull Unmixing process
ConvolutiveConvolutive
2
s1
s2
Introduction Introduction
Convolutive Blind Source Separation Convolutive Blind Source Separation Instantaneous Blind Source Separation Instantaneous Blind Source Separation
1021 PM 3
Introduction Introduction
Convolutive Blind Source Separation Convolutive Blind Source Separation Instantaneous Blind Source Separation Instantaneous Blind Source Separation
)(
)(
)(
)(
2
1
2221
1211
2
1
ts
ts
tx
tx
hh
hh
)()( tt SX H
)(
)(
)(
)(
2
1
2221
1211
2
1
ts
ts
hh
hh
tx
tx
)()( tt HSX
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
tfS
tfS
fHfH
fHfH
tfX
tfX
)()()( tfftf SX H
bull In frequency domain
Difficu
lt to
separa
te
Difficu
lt to
separa
te
Easy to
separa
te
Easy to
separa
te
1021 PM 4
Introduction Introduction
No of sources lt No of sensorNo of sources lt No of sensor1s
2s
3s
4s
1x2x3x
1s
2s1x2x3x
3s
1s
2s 1x2x3x
4s
No of sources = No of sensorNo of sources = No of sensor
No of sources gt No of sensorNo of sources gt No of sensor
Overdetermined mixingOverdetermined mixing
Determined mixingDetermined mixing
Underdetermined mixingUnderdetermined mixing
Difficult to se
parate
Difficult to se
parate
Easy to se
parate
Easy to se
parate
1021 PM 5
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Types of mixing
Instantaneous mixing Convolutive mixing
1021 PM 6
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixing
Step 1 Selection of cost function
Step 2 Minimization or maximization of the cost function
1021 PM
WHS1
S2 X2
Y1
Y2
Separated
X1
7
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsInstantaneous mixingInstantaneous mixing
Selection of cost function
Statistical independence
Information theoretic
Non-Gaussianity
Kurtosis
Negentropy
Nonlinear cross moments
Temporal structure of speech
Non-stationarity of speech1021 PM
i
ii ypp y is idea Basic
Central limit theoremMixture of two or more sources will be more Gaussian than their individual components
224 E3E)( yyykurt
yyy HHJ gauss
yyyy dppH logEntropy
0 ji ygyfE
used becan statsticsorder Secondlags timedifferent for usly simultaneon correlatiooutput theedioganaliz eg
eddiagonalizusly simultaneo are matricesn correlatio the
and blocks into divided are Signals
Non Gaussianity measures
Signals from two different sources are independent
8
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixingMinimization or maximization of the cost function
simple gradient method
Natural gradient method
Newtonrsquos method
eg Informax ICA algorithm
eg FastICA
1021 PM 9
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsConvolutive Mixing
Time Domain Frequency Domain
QqltxlwyP
p
L
lpqpq 1)()(
1
1
0
)()()( tfftf SHX
)()()( tfftf XWY
AdvantageNo permutation problem
DisadvantageSlow convergence High computational cost for long filter taps
AdvantageLow computational costFast convergence
DisadvantagePermutation Problem
WHS1
S2
X1
X2
Y1 Y2
Y2 Y1
1021 PM 10
or
Permutation Problem in Frequency Domain BSS
Permutation Problem in Frequency Domain BSS
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Still signals are mixed
K po
int
IFFT
Corresponding to different sources Due to permutation problem
One frequency binInstantaneous ICA algorithm
Solv
ing
perm
utati
on
Prob
lem y1
y2y3
Separated signals
Corresponding to y3
1021 PM 11
MotivationMotivation
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
12
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Introduction Introduction
Convolutive Blind Source Separation Convolutive Blind Source Separation Instantaneous Blind Source Separation Instantaneous Blind Source Separation
1021 PM 3
Introduction Introduction
Convolutive Blind Source Separation Convolutive Blind Source Separation Instantaneous Blind Source Separation Instantaneous Blind Source Separation
)(
)(
)(
)(
2
1
2221
1211
2
1
ts
ts
tx
tx
hh
hh
)()( tt SX H
)(
)(
)(
)(
2
1
2221
1211
2
1
ts
ts
hh
hh
tx
tx
)()( tt HSX
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
tfS
tfS
fHfH
fHfH
tfX
tfX
)()()( tfftf SX H
bull In frequency domain
Difficu
lt to
separa
te
Difficu
lt to
separa
te
Easy to
separa
te
Easy to
separa
te
1021 PM 4
Introduction Introduction
No of sources lt No of sensorNo of sources lt No of sensor1s
2s
3s
4s
1x2x3x
1s
2s1x2x3x
3s
1s
2s 1x2x3x
4s
No of sources = No of sensorNo of sources = No of sensor
No of sources gt No of sensorNo of sources gt No of sensor
Overdetermined mixingOverdetermined mixing
Determined mixingDetermined mixing
Underdetermined mixingUnderdetermined mixing
Difficult to se
parate
Difficult to se
parate
Easy to se
parate
Easy to se
parate
1021 PM 5
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Types of mixing
Instantaneous mixing Convolutive mixing
1021 PM 6
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixing
Step 1 Selection of cost function
Step 2 Minimization or maximization of the cost function
1021 PM
WHS1
S2 X2
Y1
Y2
Separated
X1
7
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsInstantaneous mixingInstantaneous mixing
Selection of cost function
Statistical independence
Information theoretic
Non-Gaussianity
Kurtosis
Negentropy
Nonlinear cross moments
Temporal structure of speech
Non-stationarity of speech1021 PM
i
ii ypp y is idea Basic
Central limit theoremMixture of two or more sources will be more Gaussian than their individual components
224 E3E)( yyykurt
yyy HHJ gauss
yyyy dppH logEntropy
0 ji ygyfE
used becan statsticsorder Secondlags timedifferent for usly simultaneon correlatiooutput theedioganaliz eg
eddiagonalizusly simultaneo are matricesn correlatio the
and blocks into divided are Signals
Non Gaussianity measures
Signals from two different sources are independent
8
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixingMinimization or maximization of the cost function
simple gradient method
Natural gradient method
Newtonrsquos method
eg Informax ICA algorithm
eg FastICA
1021 PM 9
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsConvolutive Mixing
Time Domain Frequency Domain
QqltxlwyP
p
L
lpqpq 1)()(
1
1
0
)()()( tfftf SHX
)()()( tfftf XWY
AdvantageNo permutation problem
DisadvantageSlow convergence High computational cost for long filter taps
AdvantageLow computational costFast convergence
DisadvantagePermutation Problem
WHS1
S2
X1
X2
Y1 Y2
Y2 Y1
1021 PM 10
or
Permutation Problem in Frequency Domain BSS
Permutation Problem in Frequency Domain BSS
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Still signals are mixed
K po
int
IFFT
Corresponding to different sources Due to permutation problem
One frequency binInstantaneous ICA algorithm
Solv
ing
perm
utati
on
Prob
lem y1
y2y3
Separated signals
Corresponding to y3
1021 PM 11
MotivationMotivation
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
12
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Introduction Introduction
Convolutive Blind Source Separation Convolutive Blind Source Separation Instantaneous Blind Source Separation Instantaneous Blind Source Separation
)(
)(
)(
)(
2
1
2221
1211
2
1
ts
ts
tx
tx
hh
hh
)()( tt SX H
)(
)(
)(
)(
2
1
2221
1211
2
1
ts
ts
hh
hh
tx
tx
)()( tt HSX
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
tfS
tfS
fHfH
fHfH
tfX
tfX
)()()( tfftf SX H
bull In frequency domain
Difficu
lt to
separa
te
Difficu
lt to
separa
te
Easy to
separa
te
Easy to
separa
te
1021 PM 4
Introduction Introduction
No of sources lt No of sensorNo of sources lt No of sensor1s
2s
3s
4s
1x2x3x
1s
2s1x2x3x
3s
1s
2s 1x2x3x
4s
No of sources = No of sensorNo of sources = No of sensor
No of sources gt No of sensorNo of sources gt No of sensor
Overdetermined mixingOverdetermined mixing
Determined mixingDetermined mixing
Underdetermined mixingUnderdetermined mixing
Difficult to se
parate
Difficult to se
parate
Easy to se
parate
Easy to se
parate
1021 PM 5
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Types of mixing
Instantaneous mixing Convolutive mixing
1021 PM 6
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixing
Step 1 Selection of cost function
Step 2 Minimization or maximization of the cost function
1021 PM
WHS1
S2 X2
Y1
Y2
Separated
X1
7
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsInstantaneous mixingInstantaneous mixing
Selection of cost function
Statistical independence
Information theoretic
Non-Gaussianity
Kurtosis
Negentropy
Nonlinear cross moments
Temporal structure of speech
Non-stationarity of speech1021 PM
i
ii ypp y is idea Basic
Central limit theoremMixture of two or more sources will be more Gaussian than their individual components
224 E3E)( yyykurt
yyy HHJ gauss
yyyy dppH logEntropy
0 ji ygyfE
used becan statsticsorder Secondlags timedifferent for usly simultaneon correlatiooutput theedioganaliz eg
eddiagonalizusly simultaneo are matricesn correlatio the
and blocks into divided are Signals
Non Gaussianity measures
Signals from two different sources are independent
8
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixingMinimization or maximization of the cost function
simple gradient method
Natural gradient method
Newtonrsquos method
eg Informax ICA algorithm
eg FastICA
1021 PM 9
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsConvolutive Mixing
Time Domain Frequency Domain
QqltxlwyP
p
L
lpqpq 1)()(
1
1
0
)()()( tfftf SHX
)()()( tfftf XWY
AdvantageNo permutation problem
DisadvantageSlow convergence High computational cost for long filter taps
AdvantageLow computational costFast convergence
DisadvantagePermutation Problem
WHS1
S2
X1
X2
Y1 Y2
Y2 Y1
1021 PM 10
or
Permutation Problem in Frequency Domain BSS
Permutation Problem in Frequency Domain BSS
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Still signals are mixed
K po
int
IFFT
Corresponding to different sources Due to permutation problem
One frequency binInstantaneous ICA algorithm
Solv
ing
perm
utati
on
Prob
lem y1
y2y3
Separated signals
Corresponding to y3
1021 PM 11
MotivationMotivation
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
12
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Introduction Introduction
No of sources lt No of sensorNo of sources lt No of sensor1s
2s
3s
4s
1x2x3x
1s
2s1x2x3x
3s
1s
2s 1x2x3x
4s
No of sources = No of sensorNo of sources = No of sensor
No of sources gt No of sensorNo of sources gt No of sensor
Overdetermined mixingOverdetermined mixing
Determined mixingDetermined mixing
Underdetermined mixingUnderdetermined mixing
Difficult to se
parate
Difficult to se
parate
Easy to se
parate
Easy to se
parate
1021 PM 5
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Types of mixing
Instantaneous mixing Convolutive mixing
1021 PM 6
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixing
Step 1 Selection of cost function
Step 2 Minimization or maximization of the cost function
1021 PM
WHS1
S2 X2
Y1
Y2
Separated
X1
7
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsInstantaneous mixingInstantaneous mixing
Selection of cost function
Statistical independence
Information theoretic
Non-Gaussianity
Kurtosis
Negentropy
Nonlinear cross moments
Temporal structure of speech
Non-stationarity of speech1021 PM
i
ii ypp y is idea Basic
Central limit theoremMixture of two or more sources will be more Gaussian than their individual components
224 E3E)( yyykurt
yyy HHJ gauss
yyyy dppH logEntropy
0 ji ygyfE
used becan statsticsorder Secondlags timedifferent for usly simultaneon correlatiooutput theedioganaliz eg
eddiagonalizusly simultaneo are matricesn correlatio the
and blocks into divided are Signals
Non Gaussianity measures
Signals from two different sources are independent
8
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixingMinimization or maximization of the cost function
simple gradient method
Natural gradient method
Newtonrsquos method
eg Informax ICA algorithm
eg FastICA
1021 PM 9
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsConvolutive Mixing
Time Domain Frequency Domain
QqltxlwyP
p
L
lpqpq 1)()(
1
1
0
)()()( tfftf SHX
)()()( tfftf XWY
AdvantageNo permutation problem
DisadvantageSlow convergence High computational cost for long filter taps
AdvantageLow computational costFast convergence
DisadvantagePermutation Problem
WHS1
S2
X1
X2
Y1 Y2
Y2 Y1
1021 PM 10
or
Permutation Problem in Frequency Domain BSS
Permutation Problem in Frequency Domain BSS
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Still signals are mixed
K po
int
IFFT
Corresponding to different sources Due to permutation problem
One frequency binInstantaneous ICA algorithm
Solv
ing
perm
utati
on
Prob
lem y1
y2y3
Separated signals
Corresponding to y3
1021 PM 11
MotivationMotivation
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
12
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Types of mixing
Instantaneous mixing Convolutive mixing
1021 PM 6
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixing
Step 1 Selection of cost function
Step 2 Minimization or maximization of the cost function
1021 PM
WHS1
S2 X2
Y1
Y2
Separated
X1
7
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsInstantaneous mixingInstantaneous mixing
Selection of cost function
Statistical independence
Information theoretic
Non-Gaussianity
Kurtosis
Negentropy
Nonlinear cross moments
Temporal structure of speech
Non-stationarity of speech1021 PM
i
ii ypp y is idea Basic
Central limit theoremMixture of two or more sources will be more Gaussian than their individual components
224 E3E)( yyykurt
yyy HHJ gauss
yyyy dppH logEntropy
0 ji ygyfE
used becan statsticsorder Secondlags timedifferent for usly simultaneon correlatiooutput theedioganaliz eg
eddiagonalizusly simultaneo are matricesn correlatio the
and blocks into divided are Signals
Non Gaussianity measures
Signals from two different sources are independent
8
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixingMinimization or maximization of the cost function
simple gradient method
Natural gradient method
Newtonrsquos method
eg Informax ICA algorithm
eg FastICA
1021 PM 9
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsConvolutive Mixing
Time Domain Frequency Domain
QqltxlwyP
p
L
lpqpq 1)()(
1
1
0
)()()( tfftf SHX
)()()( tfftf XWY
AdvantageNo permutation problem
DisadvantageSlow convergence High computational cost for long filter taps
AdvantageLow computational costFast convergence
DisadvantagePermutation Problem
WHS1
S2
X1
X2
Y1 Y2
Y2 Y1
1021 PM 10
or
Permutation Problem in Frequency Domain BSS
Permutation Problem in Frequency Domain BSS
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Still signals are mixed
K po
int
IFFT
Corresponding to different sources Due to permutation problem
One frequency binInstantaneous ICA algorithm
Solv
ing
perm
utati
on
Prob
lem y1
y2y3
Separated signals
Corresponding to y3
1021 PM 11
MotivationMotivation
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
12
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixing
Step 1 Selection of cost function
Step 2 Minimization or maximization of the cost function
1021 PM
WHS1
S2 X2
Y1
Y2
Separated
X1
7
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsInstantaneous mixingInstantaneous mixing
Selection of cost function
Statistical independence
Information theoretic
Non-Gaussianity
Kurtosis
Negentropy
Nonlinear cross moments
Temporal structure of speech
Non-stationarity of speech1021 PM
i
ii ypp y is idea Basic
Central limit theoremMixture of two or more sources will be more Gaussian than their individual components
224 E3E)( yyykurt
yyy HHJ gauss
yyyy dppH logEntropy
0 ji ygyfE
used becan statsticsorder Secondlags timedifferent for usly simultaneon correlatiooutput theedioganaliz eg
eddiagonalizusly simultaneo are matricesn correlatio the
and blocks into divided are Signals
Non Gaussianity measures
Signals from two different sources are independent
8
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixingMinimization or maximization of the cost function
simple gradient method
Natural gradient method
Newtonrsquos method
eg Informax ICA algorithm
eg FastICA
1021 PM 9
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsConvolutive Mixing
Time Domain Frequency Domain
QqltxlwyP
p
L
lpqpq 1)()(
1
1
0
)()()( tfftf SHX
)()()( tfftf XWY
AdvantageNo permutation problem
DisadvantageSlow convergence High computational cost for long filter taps
AdvantageLow computational costFast convergence
DisadvantagePermutation Problem
WHS1
S2
X1
X2
Y1 Y2
Y2 Y1
1021 PM 10
or
Permutation Problem in Frequency Domain BSS
Permutation Problem in Frequency Domain BSS
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Still signals are mixed
K po
int
IFFT
Corresponding to different sources Due to permutation problem
One frequency binInstantaneous ICA algorithm
Solv
ing
perm
utati
on
Prob
lem y1
y2y3
Separated signals
Corresponding to y3
1021 PM 11
MotivationMotivation
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
12
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsInstantaneous mixingInstantaneous mixing
Selection of cost function
Statistical independence
Information theoretic
Non-Gaussianity
Kurtosis
Negentropy
Nonlinear cross moments
Temporal structure of speech
Non-stationarity of speech1021 PM
i
ii ypp y is idea Basic
Central limit theoremMixture of two or more sources will be more Gaussian than their individual components
224 E3E)( yyykurt
yyy HHJ gauss
yyyy dppH logEntropy
0 ji ygyfE
used becan statsticsorder Secondlags timedifferent for usly simultaneon correlatiooutput theedioganaliz eg
eddiagonalizusly simultaneo are matricesn correlatio the
and blocks into divided are Signals
Non Gaussianity measures
Signals from two different sources are independent
8
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixingMinimization or maximization of the cost function
simple gradient method
Natural gradient method
Newtonrsquos method
eg Informax ICA algorithm
eg FastICA
1021 PM 9
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsConvolutive Mixing
Time Domain Frequency Domain
QqltxlwyP
p
L
lpqpq 1)()(
1
1
0
)()()( tfftf SHX
)()()( tfftf XWY
AdvantageNo permutation problem
DisadvantageSlow convergence High computational cost for long filter taps
AdvantageLow computational costFast convergence
DisadvantagePermutation Problem
WHS1
S2
X1
X2
Y1 Y2
Y2 Y1
1021 PM 10
or
Permutation Problem in Frequency Domain BSS
Permutation Problem in Frequency Domain BSS
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Still signals are mixed
K po
int
IFFT
Corresponding to different sources Due to permutation problem
One frequency binInstantaneous ICA algorithm
Solv
ing
perm
utati
on
Prob
lem y1
y2y3
Separated signals
Corresponding to y3
1021 PM 11
MotivationMotivation
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
12
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Approaches for BSS of Speech SignalsApproaches for BSS of Speech Signals
Instantaneous mixingMinimization or maximization of the cost function
simple gradient method
Natural gradient method
Newtonrsquos method
eg Informax ICA algorithm
eg FastICA
1021 PM 9
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsConvolutive Mixing
Time Domain Frequency Domain
QqltxlwyP
p
L
lpqpq 1)()(
1
1
0
)()()( tfftf SHX
)()()( tfftf XWY
AdvantageNo permutation problem
DisadvantageSlow convergence High computational cost for long filter taps
AdvantageLow computational costFast convergence
DisadvantagePermutation Problem
WHS1
S2
X1
X2
Y1 Y2
Y2 Y1
1021 PM 10
or
Permutation Problem in Frequency Domain BSS
Permutation Problem in Frequency Domain BSS
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Still signals are mixed
K po
int
IFFT
Corresponding to different sources Due to permutation problem
One frequency binInstantaneous ICA algorithm
Solv
ing
perm
utati
on
Prob
lem y1
y2y3
Separated signals
Corresponding to y3
1021 PM 11
MotivationMotivation
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
12
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Approaches for BSS of Speech SignalsApproaches for BSS of Speech SignalsConvolutive Mixing
Time Domain Frequency Domain
QqltxlwyP
p
L
lpqpq 1)()(
1
1
0
)()()( tfftf SHX
)()()( tfftf XWY
AdvantageNo permutation problem
DisadvantageSlow convergence High computational cost for long filter taps
AdvantageLow computational costFast convergence
DisadvantagePermutation Problem
WHS1
S2
X1
X2
Y1 Y2
Y2 Y1
1021 PM 10
or
Permutation Problem in Frequency Domain BSS
Permutation Problem in Frequency Domain BSS
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Still signals are mixed
K po
int
IFFT
Corresponding to different sources Due to permutation problem
One frequency binInstantaneous ICA algorithm
Solv
ing
perm
utati
on
Prob
lem y1
y2y3
Separated signals
Corresponding to y3
1021 PM 11
MotivationMotivation
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
12
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Permutation Problem in Frequency Domain BSS
Permutation Problem in Frequency Domain BSS
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Still signals are mixed
K po
int
IFFT
Corresponding to different sources Due to permutation problem
One frequency binInstantaneous ICA algorithm
Solv
ing
perm
utati
on
Prob
lem y1
y2y3
Separated signals
Corresponding to y3
1021 PM 11
MotivationMotivation
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
12
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
MotivationMotivation
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
12
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
My Contribution - IMy Contribution - I
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
13
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Algorithm for Solving the Permutation Problem
Algorithm for Solving the Permutation Problem
f1
f2
fk
x1x2
x3
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Permutation problem
One frequency binInstantaneous ICA algorithm
Permutation problem solved
1021 PM 14
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Direction Of Arrival (DOA) method
2
1
)sin(2 1
p
dcfjkqpkq
pkefWfU Position of the pth sensor
Velocity of sound
)(
)(
)()(
)()(
)(
)(
2
1
2221
1211
2
1
k
k
kk
kk
k
k
fX
fX
fWfW
fWfW
fY
fY
1021 PM
Direction of y1 = -30o
Direction of y2 = 20o
15
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Reasons for failure at lower freq
Lower spacing causes error in phase difference measurement
The relation is approximated for plane wave front under anechoic condition
Disadvantages
Fails at lower frequenciesFails when sources are nearRoom reverberationSensor positions must be known
Direction Of Arrival (DOA) method
1021 PM 16
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
f1
f2
fk
BSS
BSS
BSSMixed signals
K po
int
FFT
y1y2y3
Separated signals
K po
int
IFFT
Solv
ing
perm
utati
on
Prob
lem
Low correlation
High correlationLow
correlation
x1x2
x3
Adjacent bands correlation method
1021 PM 17
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21r11 r12
r21 r22
s1
s2
Correlation matrix
21122211 rrrr No change
21122211 rrrr Change permutation
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM
21122211 and and rrrr
21122211 and and rrrr
With confidence Without confidence
2090
8010
8020
1090
2090
1040
1040
2090
Example Example
18
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
K-1
K
K+1 K+2
K+3
helliphelliphelliphellip
r12
r21
r11
r22 r22 r22 r22
r11 r11 r11
r12
r21
r12
r21
r12
r21
r11 r12
r21 r22
s1
s2
Correlation matrix
Disadvantage The method is not robust
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Adjacent bands correlation method
1021 PM 19
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
1021 PM
Existing Method forSolving the Permutation Problem
Existing Method forSolving the Permutation Problem
Combination of DOA and Correlation methods method
DOA + Harmonic Correlation + Adjacent bands correlation
Advantage Increased robustness
20
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
Proposed algorithm Partial separation method(Parallel configuration)
Reference V G Reju S N Koh and I Y Soon ldquoPartial separation method for solving permutation problem in frequency domain blind source separation of speech signalsrdquo Neurocomputing Vol 71 NO 10ndash12 June 2008 pp 2098ndash2112
2y
1y
1x
2x
1s
2s
1021 PM 21
Time domain stage
Frequency domain stage
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Partial separation method(Parallel configuration)
Partial separation method(Parallel configuration)
1021 PM 22
Time domain stage
Frequency domain stage
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Parallel configuration
Partial separation method(Cascade configuration)
Partial separation method(Cascade configuration)
1021 PM 23
Time domain stage
Frequency domain stage
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Advantages of Partial Separation methodAdvantages of Partial Separation method
bull Robustness
1021 PM 24
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Comparison with Adjacent Bands Correlation Method
Comparison with Adjacent Bands Correlation Method
1021 PM 25
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
PS - Partial Separation method with confidence check C1 - Correlation between the adjacent bins without confidence check C2 - Correlation between adjacent bins with confidence check Ha - Correlation between the harmonic components with confidence check PS1 - Partial separation method alone without confidence check
1021 PM 26
Comparison with DOA methodComparison with DOA method
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
My Contribution -IIMy Contribution -II
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
27
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Underdetermined Blind Source Separation of Instantaneous Mixtures
Underdetermined Blind Source Separation of Instantaneous Mixtures
t
k
0)(0)( 222221 tkStkS
0)(0)( 222221 tkStkS
0)(0)( 112111 tkStkS
0)(0)( 112111 tkStkS
tkX 1
tkX 22x
1x
1021 PM 28
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
Mathematical Representation of Instantaneous Mixing
Reference V G Reju S N Koh and I Y Soon ldquoAn algorithm for mixing matrix estimation in instantaneous blind source separationrdquo Signal Processing Vol 89 Issue 9 September 2009 pp 1762ndash1773
)(
)(
)(
)( 1
1
1111
tkS
tkS
hh
hh
tkX
tkX
QPQP
Q
P
)(
)(
)(
)( 1
1
1111
ts
ts
hh
hh
tx
tx
QPQP
Q
P
)()()(11
1
1
11
tkS
h
h
tkS
h
h
tkS
h
h
Q
PQ
Q
q
Pq
q
P
)()()()( 21 tkStkStkStk QhhhX
Time domain
Time-Frequency domain
1021 PM 29
P ndash No of mixturesQ ndash No of sources
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
00point at 1 Case 11211111 tkStkStk
)()()( 1122111111 tkStkStk hhX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 2222221122 tkStkStk hhX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
1021 PM
0 0
30
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Q
qqq tkStk
1
hX
2Let Q
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM 31
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
00point at 1 Case 11211111 tkStkStk
)()( 111111 tkStk hX
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()( 222222 tkStk hX
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2
Scalar
ScalarScalar
Scalar
At single source point 1 At single source point 2
111
111
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR 222
222
ofDirection )( ofDirection
ofDirection )( ofDirection
hX
hX
tkI
tkR
Single Source Points in Time-Frequency domainSingle Source Points in
Time-Frequency domain
1021 PM
1111 ( of Direction ( of Direction tkItkR XX 2222 ( of Direction ( of Direction tkItkR XX 32
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Perfectly Sparse
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 0
Example
1021 PM 33
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
)5()4()3()2()1(
22222
11111
2221
1211
22222
11111
kSkSkSkSkS
kSkSkSkSkS
hh
hh
kXkXkXkXkX
kXkXkXkXkX
21
111 h
hh
22
122 h
hh
0 00 0 00
Example
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
Scatter Diagram of the Mixtures When Source are Not Perfectly Sparse
1021 PM 34
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Scatter Diagram of the Mixtures when Sources are Sparse
Scatter Diagram of the Mixtures when Sources are Sparse
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
35
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
Scatter Diagram of the Mixtures when Sources are Sparse After Clustering
3h
6h
5h
1h4h
2h
1021 PM
No of sources = 6No of mixtures = 2
36
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
Scatter Diagram of the Mixtures when Sources are Not Perfectly Sparse
3h
6h
5h
1h4h
2h
1021 PM
ObjectiveEstimation of the single source points
No of sources = 6No of mixtures = 2
37
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Multi source point
38
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
00point At 1 Case 11211111 tkStkStk
)()(
)()(
111111
111111
tkSItkI
tkSRtkR
hX
hX
Single source point 1
00point At 2 Case 22222122 tkStkStk
)()(
)()(
222222
222222
tkSItkI
tkSRtkR
hX
hX
Single source point 2Scalar
ScalarScalar
Scalar
00point At 3 Case 33233133 tkStkStk
)()()(
)()()(
3322331133
3322331133
tkSItkSItkI
tkSRtkSRtkR
hhX
hhX
)()()( 3322331133 tkStkStk hhX
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
Multi source point
39
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
)(
)(
)(
)( ifonly and if
)( ofDirection )( ofDirection
332
332
331
331
3333
tkSI
tkSR
tkSI
tkSR
tkXItkXR
Average of 15 pairs of speech utterances of length 10 s each
1021 PM
Principle of the Proposed Algorithm for the Detection of Single Source Points
Principle of the Proposed Algorithm for the Detection of Single Source Points
)( ofDirection )( ofDirection tkXItkXR )( ofDirection )( ofDirection tkXItkXR
SSPMSP
40
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
SSPt
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)()(
)()(
112112
111111
tkXjItkXR
tkXjItkXR
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXI
)(
)(
112
111
tkXR
tkXR
)(
)(
112
111
tkXI
tkXIMSP
)( 111 tkX
)( 112 tkX
cos
)()(
)()(
tkXItkXR
tkXItkXR T
Proposed Algorithm for the Detection of Single Source Points
Proposed Algorithm for the Detection of Single Source Points
1021 PM 41
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Elimination of OutliersElimination of Outliers
SSPs detection
Clus
terin
g
Outlier elimination
1021 PM 42
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
1021 PM
Experimental ResultsExperimental Results
dB6747NMSE
No of mixtures =2 No of sources =6
43
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Detected Single Source PointsThree mixtures
Detected Single Source PointsThree mixtures
No of mixtures =3 No of sources =61021 PM 44
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Comparison with Classical Algorithms for Determined Case
Comparison with Classical Algorithms for Determined Case
No of mixtures =2No of sources =2
Average of 500 experimental results
1021 PM 45
-gt
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Comparison with Method Proposed in [1] Underdetermined case
Comparison with Method Proposed in [1] Underdetermined case
[1] Y Li S Amari A Cichocki D W C Ho and S Xie ldquoUnderdetermined blind source separation based on sparse representationrdquo IEEE Transactions on Signal Processing vol 54 p 423ndash437 Feb 2006
1021 PM
Nor
mal
ized
mea
n sq
uare
err
or (N
MSE
) in
mix
ing
mat
rix e
stim
ation
(dB)
Order of the mixing matrices (PxQ)
46
P ndash No of mixturesQ ndash No of sources
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Advantages of the Proposed algorithmAdvantages of the Proposed algorithm
Step 1 Convert x in the time domain to the TF domain to get XStep 2 Check the condition
Step 3 If the condition is satisfied then X(k t) is a sample atthe SSP and this sample is kept for mixing matrix estimationotherwise discard the point
Step 4 Repeat Steps 2 to 3 for all the points in the TF planeor until sufficient number of SSPs are obtained
cos
)()(
)()(
tkXItkXR
tkXItkXR T
1) Much simpler constrain the algorithm does not require ldquosingle source zonerdquo
3) The algorithm is extremely simple but effective2) Separation performance is better
1021 PM 47-gt
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
My Contributions ndash III IV and VMy Contributions ndash III IV and V
1021 PM1021 PM
mixtures ge sources mixtures ge sources
mixtures lt sources mixtures lt sources
BSS
Determined Overdetermined
Underdetermined
Instantaneous
Instantaneous
Convolutive
Convolutive
Frequency domain
Frequency domain
Time domain
Time domain
Mixing matrix estimation
Frequency bin-wise separation
Frequency bin-wise separation
Permutation problem
Permutation problem
Source estimation
Automatic detection of no of sources
48
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Underdetermined Convolutive Blind Source Separation via Time-Frequency Masking
Reference V G Reju S N Koh and I Y Soon ldquoUnderdetermined Convolutive Blind Source Separation via Time- Frequency Maskingrdquo IEEE Transactions on Audio Speech and Language Processing Vol 18 NO 1 Jan 2010 pp 101ndash116
Mask estimation
Mic 1
Mic P
Mixture in TF domain
Separated signals in TF domain
)( tkX P
)(1 tkX
t
t
k
k
)(1 tkY
)(1 tkY
)( tkYQ
)( tkYQ
QP
1021 PM 49
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Mathematical RepresentationMathematical Representation
Q
q
L
lqpqp Pplnslhnx
1
1
0
1)()()(
)()()()()()()( 11 tkSktkSktkSktk QQqq HHHX
)(
)(
)()(
)()(
)(
)( 1
1
1111
tkS
tkS
kHkH
kHkH
tkX
tkX
QPQP
Q
P
)(
)(
)(
)(
)(
)(
)(
)(
)( 11
1
1
11
tkS
kH
kH
tkS
kH
kH
tkS
kH
kH
Q
PQ
Q
q
Pq
q
P
Time domain
Frequency domain
1021 PM 50
P ndash No of mixturesQ ndash No of sources
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Single source pointsSingle source pointsInstantaneous mixing
00point at 1 Case 11211111 tkStkStk
)()()(
)()()(
111111
111111
tkSIktkI
tkSRktkR
HX
HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()(
)()()(
222222
222222
tkSIktkI
tkSRktkR
HX
HX
Single source point 2
Real scalar
RealReal
Real scalarReal scalar
Real scalar
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
1021 PM 51
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Basic Principle of Single Source Points Detection
Basic Principle of Single Source Points Detection
Convolutive mixing
00point at 1 Case 11211111 tkStkStk
)()()( 111111 tkSktk HX
Single source point 1
00point at 2 Case 22222122 tkStkStk
)()()( 222222 tkSktk HX
Single source point 2
Complex scalar
Complex
Complex
Complex scalar
j
H
C
e
cos21
21
UU
UUUUU H
and 20coscos HCH
angle pseudo called is
angleHermitian called is
H The Hermitian angle between the complex
vectors u1 and u2 will remain the same even if the vectors are multiplied by any complex scalars whereas the pseudo angle will change
1021 PM 52
-gt
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
k
Algorithm for Single Source Points Detection
Algorithm for Single Source Points Detection
t
k
tkX 1
tkX 22x
1x
)( 111 tkX
)( 112 tkX
)(
)(
)(111
121
111 tkSkH
kH
11
11
j
jr
)(
)(
112
111
tkX
tkX
11
11
j
jr
)(
)(
112
111
tkX
tkX
)( 111 tkX
)( 112 tkX
)(
)(
)(112
122
112 tkSkH
kH θH2
θH1
θH2
rX
rX
)(
)(cos
11
111
tk
tk H
Hq
1021 PM 53
θH1
OR
1k
1k
1t
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Clean
Estimated
Mask Estimation by k-means (KM)Mask Estimation by k-means (KM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 54
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Clean
Estimated
Mask Estimation by Fuzzy c-means (FCM)Mask Estimation by Fuzzy c-means (FCM)
QqttkXtkMtkY pqq 1)()()(
1021 PM 55
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Automatic Detection of Number of SourcesAutomatic Detection of Number of Sources
1021 PM 56
Cluster validation technique
For c = 2 to cmax
Cluster the data into c clustersCalculate the cluster validation index
EndTake c corresponding to the best cluster as the number of sources
-gt
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57
Elimination of Low Energy PointsElimination of Low Energy Points
1021 PM 57