Music Information Retrieval Information Universe Seongmin Lim hovern@snu.ac.kr Dept. of Industrial...

Music Information RetrievalInformation Universe

Seongmin Lim

hovern@snu.ac.kr

Dept. of Industrial Engineering

Seoul National University

contents

Brief history of MIR and state of research

Cross media retrieval supporting Natural language queries like mood, melody information.- Contain semantic information taken from community data bases- “A Music Search Engine Built upon Audio-based and Web-based

Similarity Measures”

Query by Example- You have an example query having the same representation in

the database.- For music search: humming, recorded by cell phones,

microphones- “Music Structure Based Vector Space Retrieval”

Stages of First Paper

“A Music Search Engine Built upon Audio-based and Web-based Similarity Measures”

Stage 1: Preprocessing the Collection

Using information in the ID3 tag- Artist- Album- Title

all duplicates of tracks are excluded to avoid redundancies

Live or instrumentals of the same song removed

Stage 2: Web based features addition

Search on the web for- “artist”music- “artist”“album”music review- “artist”“title”music review –lyrics

Stage 2: Web based features addition (2)

Every term is weighted according to the term frequency ×inverse document frequency (tf×idf) function. w(t,m) of a term t for music piece m. N is the total number of documents.

Stage 3: Audio Based Similarity measures

For each audio track, Mel Frequency Cepstral Coefficients (MFCCs) are computed on short-time audio segments (called frames)

each song is represented as a Gaussian Mixture Model (GMM) of the distribution of MFCCs

Kullback-Leibler divergence can be calculated on the means and covariance matrices

A rank list of similar tracks is found based on this measure corresponding to each track

GMM(Gaussian Mixture Model)

a probabilistic model for representing the presence of sub-populations within an overall population

the mixture distribution that represents the probability distribution of observations in the overall population

Stage 4: Dimensionality Reduction

chi square test to distinguish the most similar terms using audio similarities

A is the number of documents in s which contain t B is the number of documents in d which contain t C is the number of documents in s without t D is the number of documents in d without t N is the total number of examined documents

Stage 5: Vector Adaptation

Smoothing for tracks where no related information

Querying the Music Search Engine

method to find those tracks that are most similar to a natural language query

extend queries to the music search engine by the word music and send them to Google

Query vector is constructed in the feature space from the top 10 pages retrieved

Euclidean distances are calculated from the collection tracks and a relevance ranking is got

Evaluating the System

to evaluate on “real-world” queries, a source for phrases which are used by people to describe music is needed

Tags provided by AudioScrobbler groundtruth is used

227 tags are used

as test queries

Goal of the evaluation

Goals- Effect of dimensionality on the feature space- Retrieving relevant information - Effect of re weighting of the term vectors- Effect of query expansion

Metrics used : precision values for various recall levels

Performance Evaluation -I

audio-based term selection has a very positive impact on the retrieval

setting 2/50 yields best results

Performance Evaluation -II

Effect of re weighting using various re weighting techniques

the impact of audiobased vector re-weighting is only marginal

Performance Evaluation –III (other metrics)

Examples

System design of Second paper

“Music structure based vector space retrieval”

Music Layout : The Pyramid

Stage 1: MUSIC INFORMATION MODELING

Music Segmentation by smallest note length

Cord modeling

Music region content modeling

Stage 2: MUSIC INDEXING AND RETRIEVAL

Harmony Event and Acoustic Event- each song’s cord and music region information is represented as

a Gaussian Mixture Model (GMM) of the distribution of MFCCs

n-gram Vector- The harmony and acoustic decoders serve as the tokenizers for

music signal- an event is represented in a text-like format

Stage 3: Music information retrieval

Summary

Natural query vs. query by example Information from web and audio Audio frame segmentation KL divergence vs. vector space modeling Analyzing audio features Data itself vs. metadata domain knowledge of music

End of Document

Seongmin Lim

hovern@snu.ac.kr

Dept. of Industrial Engineering

Seoul National University

Music Information Retrieval Information Universe Seongmin Lim hovern@snu.ac.kr Dept. of Industrial...

Documents

Seoul National University - Eric Y. Hsiaoastro1.snu.ac.kr/eama10/20160929_EAMA10_Mugunghwa/Session... · 2016-10-07 · Eric Y. Hsiao hsiao@physics.fsu.edu EAMA Seoul, September,

조직 구성원의 심리적 안녕: 리뷰와 메타분석 연구hosting03.snu.ac.kr/~wwpark/sub/jrct/j00078.pdf · 심리적 안녕 ․주관적 안녕 ․종합적 안녕 ․직장

서울대학교교과과정해설1 - ie2.snu.ac.krie2.snu.ac.kr/IE_curriculum_guide1.pdf · 1본문 서는2008 년부터 2018 까지 발행된「 울대학교교 과과정」에 ‘교

I-1 Internet Intro Taekyoung Kwon tkkwon@snu.ac.kr

CUDA. Assignment Subject: DES using CUDA Deliverables: des.c, des.cu, report Due: 12/14, nai0315@snu.ac.kr

s 서울대리플kaku3.snu.ac.kr/upload/about/guidemap_eng.pdf · Seven subsidiary libraries specializing in social sciences, business administration, international studies, agricultural,

글로벌공학교육센터컨벤션 - Seoul National Universitysnu38.snu.ac.kr/convxe/html/brochure.pdf · 2019-07-19 · 대관 문의 Tel.02-880-1544,1599 E-mail. gcp38@snu.ac.kr

Structural Characterization of Febuxostat/l-Pyroglutamic Acid Cocrystal Using Solid ...hosting03.snu.ac.kr/~suhlab/2008/pub/158a.pdf · 2018-12-11 · crystals Article Structural

snu.ac.kr · Created Date: 7/20/2010 11:29:13 AM

OpenSGX: An Open Platform for SGX Researchopensgx.pdf · OpenSGX: An Open Platform for SGX Research Prerit Jain †Soham Desai Seongmin Kim⋆ Ming-Wei Shih† JaeHyuk Lee⋆ Changho

Information Structure in PA/SN or Descriptive/ …hosting03.snu.ac.kr/~clee/papers/Lee_information structure in PA SN... · Information Structure in PA/SN or Descriptive/ Metalinguistic

From Global Value Chains (GVC) to Innovation Systems for ...econbk21.snu.ac.kr/sites/econbk21.snu.ac.kr/files/board/양식게시판... · The two pillar concepts in GVC are governance

New Seoul National Universityhosting03.snu.ac.kr/~korean/old/data/morphology/... · 2016. 5. 24. · Morphological Typology of Deponency* MATTHEW BAERMAN 1. Introduction 'DEPONENCY

Comparative Proton Transfer Eﬃciencies of Hydronium …hosting03.snu.ac.kr/~surfion/Himage/pdf/133.pdf · Comparative Proton Transfer Eﬃciencies of ... and static hypercoordination

Highly Branched Polycaprolactone/Glycidol Copolymeric Green …hosting03.snu.ac.kr/~eco/file/140.pdf · 2020-01-20 · Highly Branched Polycaprolactone/Glycidol Copolymeric Green

4장확률분포 Probability distributionshosting03.snu.ac.kr/~hokim/int/2018/chap_4.pdf · 2018. 3. 19. · 4.1 이산확률분포(probability density function of a discrete random

Lynn Ilon Seoul National University lynnilon@snu.ac.kr

Experimental Faulting of Serpentinite during Dehydration: Implications for Earthquakes ...hosting03.snu.ac.kr/~hjung/pdf/Jung_and_Green-IGR-2004.pdf · 2008-10-28 · Experimental

Digital Electronics 2 - Transistors Introduction to CAD Naehyuck Chang naehyuck@snu.ac.kr

I-3 content-centric networking Taekyoung Kwon (TK) tkkwon@snu.ac.kr Some slides are from Van Jacobson@PARC 1