14
Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

Embed Size (px)

Citation preview

Page 1: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

Polyphonic Transcription

Bruno Angeles

McGill University - Schulich School of Music

MUMT-621

Fall 2009

1/14

Page 2: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

Outline

Polyphonic vs Monophonic The Human Method Issues Methods Evaluation of Transcription Methods

2/14

Page 3: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

Polyphonic vs Monotonic Transcription

Audio signal with musical content

symbolic format Monophonic: a single instrument playing individual notes Polyphonic transcription:

A single instrument playing several concurrent notes An instrument playing individual notes with other

instruments An instrument playing several concurrent notes with other

instruments Several instruments playing concurrent notes

3/14

Page 4: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

The Human Method

1. Initial sketch Sections and key phrases

2. Chord scheme or bass line

3. Melody and counter–melodies

Instrument playback Musical knowledge Beat tracking Style detection Instrument identification

Hainsworth and Macleod (2004)4/14

Page 5: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

Issues with Polyphonic Transcription

Restrictions:instrument, genre, maximum polyphony

Transcription:pitch + timing + attack + release

Overlapping harmonics difficult frequency–based analysis

Solution: better understanding and reproduction of the human method?

5/14

Page 6: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

A Few Definitions

Musical note: “a discrete note pitch with a specific onset and an offset time”

Melodies: “consecutive note sequences with organized and recognizable shape”

Chords: “combinations of simultaneously sounding notes”

Ryynänen and Klapuri (2005)

6/14

Page 7: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

Methods Bottom–up methods:

no high–level analysis

Blackboard systems:develop hypotheses at various levels

Model–based algorithms:high– and low–level analysis + parameter extraction

Common method:

1. Preprocessing (e.g., lowpass filtering)

2. Event extraction

3. Classification: Support Vector Machines, Hidden Markov Models, Neural Networks, Gaussian Mixture Models, etc.

4. Postprocessing (e.g., remove outliers)

Other methods: probabilistic note-based, matrix factorization

7/14

Page 8: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

Evaluation of Transcription Methods

NC: number of correct events detected

ND: total number of events detected

N: actual number of events

Precision: P = NC / ND

Recall: R = NC / N

F–Measure: F = 2RP/(R+P) = 2 NC / (N + ND)

8/14

Page 9: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

Evaluation of Transcription Methods

Gillet and Richard (2005) 2 drummers Different musical

sequences

Celtic & Groove 5/4:Many ghost notes

Table 4 of Gillet and Richard (2005)9/14

Page 10: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

Evaluation of Transcription Methods

Ryynänen and Klapuri (2005):R ≈ P ≈ 40%Multiple instrumentsMultiple concurrent notes

Hainsworth and Macleod (2005):P = 78.7%Bass guitarPolyphonic context

Gillet and Richard (2005):R ≈ P ≈ 84%Drums

10/14Ryynänen and Klapuri (2005):

Page 11: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

Conclusion

Analogy with speech recognition The problem has not yet been solved Still easier for humans to do – although complex Combine with source separation?

11/14

Page 12: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

Thank you!

12/14

Page 13: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

ReferencesBruno, I., S. L. Monni, and P. Nesi. 2003. Automatic music transcription supporting different instruments. In

Proceedings of the Third International Conference on Web Delivering of Music. Leeds, UK. 37–44.

Cemgil, A. T., B. Kappen, and D. Barber. 2003. Generative model based polyphonic music transcription. In Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics. New Paltz, NY, USA. 181–4.

Gillet, O. K., and G. Richard. 2005. Drum Track Transcription of Polyphonic Music Using Noise Subspace Projection. In Proceedings of the International Conference on Music Information Retrieval. London, UK. 92–99.

Hainsworth, S. W. 2001. Analysis of musical audio for polyphonic transcription. In First Year Ph.D. Report.

Hainsworth, S. W., and M. D. Macleod. 2001. Automatic bass line transcription from polyphonic music. In Proceedings of the International Computer Music Conference. Havana, Cuba.

Hainsworth, S. W., and M. D. Macleod. 2004. The automated music transcription problem. In Cambridge University Engineering Department.

Klapuri, A. 2004. Signal processing methods for the automatic transcription of music. In Ph.D. Dissertation.

13/14

Page 14: Polyphonic Transcription Bruno Angeles McGill University - Schulich School of Music MUMT-621 Fall 2009 1/14

ReferencesLidy, T., A. Rauber, A. Pertusa, and J. M. Iñesta. 2007. Improving genre classification by combination of audio and

symbolic descriptors using a transcription system. In Proceedings of the International Conference on Music Information Retrieval. Vienna, Austria. 61–6.

Marolt, M. 2001. SONIC: Transcription of polyphonic piano music with neural networks. In Proceedings of the Workshop on Current Research Directions in Computer Music. Barcelona, Spain. 217–24.

Nichols, E., and C. Raphael. 2007. Automatic transcription of music audio through continuous parameter tracking. In Proceedings of the International Conference on Music Information Retrieval. Vienna, Austria. 387–92.

Niedermayer, B. 2008. Non-negative matrix division for the automatic transcription of polyphonic music. In Proceedings of the International Conference on Music Information Retrieval. Philadelphia, PA, USA. 544–9.

Paulus, J. 2006. Acoustic modelling of drum sounds with Hidden Markov Models for music transcription. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing. Toulouse, France.

Pertusa, A., and J. M. Iñesta. 2003. Polyphonic music transcription through dynamic networks and spectral pattern identification. In IAPR International Workshop on Artificial Neural Networks in Pattern Recognition. 19–25.

Ryynänen, M., and A. Klapuri. 2006. Transcription of the singing melody in polyphonic music. In Proceedings of the International Conference on Music Information Retrieval. Victoria, BC, Canada.

14/14