Download Sinusoid Extraction and Salience Function Design for Predominant Melody Estimation In this paper we evaluate some of the alternative methods commonly applied in the first stages of the signal processing chain of automatic melody extraction systems. Namely, the first two stages are studied – the extraction of sinusoidal components and the computation of a time-pitch salience function, with the goal of determining the benefits and caveats of each approach under the specific context of predominant melody estimation. The approaches are evaluated on a data-set of polyphonic music containing several musical genres with different singing/playing styles, using metrics specifically designed for measuring the usefulness of each step for melody extraction. The results suggest that equal loudness filtering and frequency/amplitude correction methods provide significant improvements, whilst using a multi-resolution spectral transform results in only a marginal improvement compared to the standard STFT. The effect of key parameters in the computation of the salience function is also studied and discussed.
Download Generalized Reassignment with an Adaptive Polynomial-Phase Fourier Kernel for the Estimation of Non-Stationary Sinusoidal Parameters This paper describes an improvement of the generalized reassignment method for estimating the parameters of a modulated real sinusoid. The main disadvantage of this method is decreased accuracy for high log-amplitude and/or frequency changes. One of the reasons for such accuracy deterioration stems from the use of the Fourier transform. Fourier transform belongs to a more general family of integral transforms and can be defined as an integral transform using a Fourier kernel function - a stationary complex sinusoid. A correlation between the Fourier kernel function and a non-stationary sinusoid decreases as the modulation of the sinusoid increases, ultimately causing the parameter estimation deterioration. In this paper, the generalized reassignment is reformulated for use with an arbitrary kernel. Specifically, an adaptive polynomial-phase Fourier kernel is proposed. It is shown that such an algorithm needs the parameter estimates from the original generalized reassignment method and that it improves the Signalto-Residual ratio (SRR) in the non-noisy cases. The drawbacks concerning the initial conditions and ways of avoiding a close-tosingular system of linear equations are discussed.