Justin Salamon, Prem Seetharaman (Implementation Author), Ethan Manilow (Implementation Author) New York University, Northwestern University [email protected]
- is_blind: yes
- additional_training_data: no
This is the nussl
implementation of masking using pitch tracking. The pitch tracking algorithm
used is the Melodia pitch tracker from Salamon & Gómez. Using the pitch, we create a mask on the
mixture. The abstract for Melodia is below:
We present a novel system for the automatic extraction of the main melody from polyphonic music recordings. Our approach is based on the creation and characterization of pitch contours, time continuous sequences of pitch candidates grouped using auditory streaming cues. We define a set of contour characteristics and show that by studying their distributions we can devise rules to distinguish between melodic and non-melodic contours. This leads to the development of new voicing detection, octave error minimization and melody selection techniques. A comparative evaluation of the proposed approach shows that it outperforms current state-of-the-art melody extraction systems in terms of overall accuracy. Further evaluation of the algorithm is provided in the form of a qualitative error analysis and the study of the effect of key parameters and algorithmic components on system performance. Finally, we conduct a glass ceiling analysis to study the current limitations of the method, and possible directions for future work are proposed.
- Salamon, Justin, and Emilia Gómez. "Melody extraction from polyphonic music signals using pitch contour characteristics." IEEE Transactions on Audio, Speech, and Language Processing 20.6 (2012): 1759-1770.