13
Non-negative Matrix Factor Deconvolution; Extracation of Multiple Sound Sources from Monophonic Inputs International Symposium on Independent Component Analysis and Blind Source Separation, ICA 2004 Paris Smaragdis / Mitsubishi Electric Research Laboratori Presenter: Jain_De ,Lee

International Symposium on I ndependent Component Analysis and Blind Source Separation, ICA 2004

  • Upload
    avery

  • View
    51

  • Download
    1

Embed Size (px)

DESCRIPTION

Non-negative Matrix Factor Deconvolution ; Extracation of Multiple Sound Sources from Monophonic Inputs. International Symposium on I ndependent Component Analysis and Blind Source Separation, ICA 2004. Paris Smaragdis / Mitsubishi Electric Research Laboratories. - PowerPoint PPT Presentation

Citation preview

Page 1: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Non-negative Matrix Factor Deconvolution;Extracation of Multiple Sound Sources from Monophonic Inputs

International Symposium on Independent Component Analysis and Blind Source Separation, ICA 2004

Paris Smaragdis / Mitsubishi Electric Research Laboratories

Presenter: Jain_De ,Lee

Page 2: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Outline

• Introduction

• Non-negative Matrix Factorization

• Non-negative Matrix Factor Deconvolution

• Conclusions

Page 3: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Introduction

• Theory of The Origin

• An extension to the Non-Negative Matrix Factorization algorithm – Identifying components with temporal

structure

Paatero (1997)

Lee&Seung (1999)

Page 4: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Non-negative Matrix Factorization

• The Original Formulation of NMF

NRRMNM HWV

[W] : Basis Functions Matrix

[H] :Time Weights Matrix

Page 5: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Non-negative Matrix Factorization

• The Cost Function

• Multiplicative Update Algorithm

1

T

T

WHWV

WHH T

T

H

HHWV

WW

1

F

WHVWH

VVD )ln(

Page 6: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Non-negative Matrix Factorization

• NMF for Sound Object Extraction

STFT

Page 7: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Non-negative Matrix Factorization

Page 8: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Non-negative Matrix Factor Deconvolution

• The Formulation of NMFD

• The Operator Shifts The Columns

WHV

1

0

T

t

t

t HWV

987

654

321

A

870

540

2101

A

700

400

1002

A ….

i

)(

Page 9: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Non-negative Matrix Factor Deconvolution

• The Cost Function

• The Update Rules

F

VV

VD

ln

1

Tt

t

Tt

W

VW

HH Tt

Tt

tt

H

HV

WW

1

10 Tt

1

0

T

t

t

t HW, where

Page 10: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Non-negative Matrix Factor Deconvolution

Page 11: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Non-negative Matrix Factor Deconvolution

• In this example the drum sounds exhibit some overlap at both time and frequency

Three types of drum sounds present into the scece Sampled at 11.025 kHz 256-point DFTs which were overlapping by 128-points Performed for 3 basis functions

Page 12: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Non-negative Matrix Factor Deconvolution

• Reconstruction

tT

t

jtj HWV

1

0

)(

Page 13: International Symposium on I ndependent Component  Analysis and Blind Source  Separation, ICA 2004

Conclusions

• Pinpointed some of the shortcomings of conventional NMF when analyzing temporal patterns and presented an extension

• Spectral bases have been used on spectrograms to extract sound objects from single channel sound scenes