Download Separation of overlapping impulsive sounds by bandwise noise interpolation
The task of extracting harmonic content of multiple pitched sources from a mono audio mix has been investigated on several occasions [1, 2, 3, 4]. However, most pitched notes contain an inharmonic component, which is an important perceptual attribute of the sound. This content is usually not dealt with during separation. It would also be interesting in its own right to develop separation techniques for extracting percussive sounds for polyphonic mixes. This paper describes an attempt at separating overlapping impulsive content of multiple sources from a mono mix. The method uses an interpolation within individual frequency bands of the decaying noise envelope of each source across overlapping sections with other sources. Three analysis methods determining the distribution of these bands were tested: the DFT followed by processing in Bark bands, the discrete wavelet transform (DWT), and the dyadic wavelet packet transform (DWPT).
Download Playing cylinders of mechanical organs with an optical reader
This study presents an experimental setup designed to read, by means of optical techniques, the music inscribed on automatic organ cylinders. We describe the acquisition principle based on images taken by a CCD linear camera, and the various digital signal processing techniques employed to retrieve the music from the images. Preliminary results show that this original method is a relevant choice, since on our test cylinder about 90 % of the notes are correctly identified, with only 14 % of false alarms. However, for realistic estimates of the actual music, some improvements are still necessary, both in the experimental setup and in the way individual note positions are converted into music.
Download A robust algorithm for partial tracking of music signals
In this paper we propose a novel approach for tracking of partials in music signals based on a robust Kalman filter. Our tracker is based on a regularized least-squares approach that is designed to minimize the worst-possible regularized residual norm over the class of admissible uncertainties at each iteration. We introduce a set of state-space models for our signals based on the evolution of frequency and amplitude in different classes of musical instruments. These prior models are used to estimate future values of partial tracks in successive time frames of our spectral data. Here, the parameters of evolution models are treated as bounded uncertainties and our tracker can robustly track partials in all frequency regions. Unlike the conventional Kalman tracker, performance of this tracker is not influenced by the magnified track variations in higher frequencies. This tracker promises an improved performance over conventional Kalman tracker while preserving its good properties and superiority over existing methodologies.
Download On the Definition of Musical Notes from Pitch Tracks for Melody Detection in Polyphonic Recordings
The present study addresses the problem of defining musical notes from pitch tracks, in the context of a system for melody detection in polyphonic musical signals. This is an important issue for melody transcription, as well as melody-based music information retrieval. Previous work in the area tackled mainly the extraction of melodic pitch lines, without explicit determination of musical notes. Therefore, in this paper we propose an approach for the creation of musical notes based on a two-stage segmentation of pitch tracks. In the first step, frequency-based segmentation is carried out through the detection of frequency variations in pitch tracks. In the second stage, salience-based segmentation is performed so as to split consecutive notes with equal value, by making use of salience minima and note onsets.
Download Finding intensities and temporal characteristics in piano music
Timing and dynamics are two important factors in music performance. Research on dynamics-related issues is comparatively rare because data on dynamics is difficult to obtain from music performance. Nevertheless, research of this kind is vital to the understanding of music performance and here we are investigating ways to identify the intensities of individual notes in a mixture of simultaneous notes. The approach to this problem is divided into two stages. The first stage consists of obtaining the magnitude of the fundamental frequency of an individual note to determine its intensity out of a mixture of simultaneous notes, on condition that the corresponding pitches of which are given. Two simultaneous notes one or two octaves apart are also included in this study. The second stage consists of generating, artificially, a mixture of notes from a recorded single-note database, subsequently referred to as “estimated mixture”. The time lag between individual notes in the estimated mixture is adjusted, so that the residual between which and the input comes to a minimum. The proposed method is verified with real data and the result is satisfactory.
Download Generating similarity-based playlists using travleling salesman algorithms
When using a mobile music player en-route, usually only little attention can be paid to its handling. Nonetheless it is desirable that all music stored in the device can be accessed quickly, and that tracks played in a sequence should match up. In this paper, we present an approach to satisfy these constraints: a playlist containing all tracks stored in the music player is generated such that in average, consecutive pieces are maximally similar. This is achieved by applying a Traveling Salesman algorithm to the pieces, using timbral similarities as the distances. The generated playlist is linear and circular, thus the whole collection can easily be browsed with only one input wheel. When a chosen track finishes playing, the player advances to the consecutive tracks in the playlist, generally playing tracks similar to the chosen track. This behavior could be a favorable alternative to the wellknown shuffle function that most current devices – such as the iPod shuffle, for example – have. We evaluate the fitness of four different Traveling Salesman algorithms for this purpose. Evaluated aspects were runtime, the length of the resulting route, and the genre distribution entropy. We implemented a Java applet to demonstrate the application and its usability.
Download Comparing synthetic and real templates for dynamic time warping to locate partial envelope features
In this paper we compare the performance of a number of different templates for the purposes of split point identification of various clarinet envelopes. These templates were generated with AttackDecay-Sustain-Release (ADSR) descriptions commonly used in musical synthesis, along with a set of real templates obtained using k-means clustering of manually prepared test data. The goodness of fit of the templates to the data was evaluated using the Dynamic Time Warping (DTW) cost function, and by evaluating the square of the distance of the identified split points to the manually identified split points in the test data. It was found that the best templates for split point identification were the synthetic templates followed by the real templates having a sharp attack and release characteristic, as is characteristic of the clarinet envelope.
Download Event Synchronous Wavelet Transform Approach to the Extraction of Musical Music Thumbnails
Fast browsing of digital collections of music would largely benefit from the availability of representative audio excerpts of the pieces. Similar to their visual counterparts found in digital photo albums, music thumbnails should offer a comprehensive listening experience while requiring a limited storage space or communication data rate. The approach to the generation of musical thumbnails of music proposed in this paper is based on an application of the PitchSynchronous Wavelet Transform, where the “pitch” is tuned to the elementary measure of the piece. The music thumbnail is encoded by the low rate coefficient sequence pertaining to the scaling residue of the transform. The scaling component represents the pseudo-periodic trend of the piece over several measures. Due to pseudo-periodicity, the time duration of the thumbnail can be arbitrarily extended in listening with no audible artifacts.
Download Improved adjustable boundary condition for the 2-D finite-difference time-domain mesh
The digital waveguide (DWG) mesh is a method for simulating wave propagation in multiple dimensions. Boundary conditions are needed for modeling changes in wave propagation media such as walls and furniture in a room or boundaries of a resonating membrane of a musical instrument. The boundary conditions have been solved for a one-dimensional DWG structure, but there is no known exact solution for the multi-dimensional mesh. In this work, a new boundary structure is introduced for modeling reflection coefficient values −1 ≤ r ≤ 1 in two dimensions. The new method gives remarkably more accurate results than the earlier approximations, especially at the low absolute values of r. At incident angles of Θ < 60o , the absolute error of reflection coefficient r is below 0.1 at frequencies 0.004 < f < 0.222 relative to the sampling frequency and at 60o ≤ Θ ≤ 80o the same result is reached at 0.005 < f < 0.114.
Download Sound Synthesis for Nonlinear Plates
In this paper, a simple finite difference scheme for a rectangular dynamic nonlinear plate, under free boundary conditions is presented. The algorithm is straightforward to program, and is capable of reproducing, to a first approximation, the behaviour of various percussion instruments whose timbre depends crucially on nonlinear effects (due to high-speed strikes), including transient pitch glides and the buildup of high-frequency energy. Though computationally intensive, algorithms such as that presented here promise more faithful sound synthesis and, as with all physical model inspired synthesis algorithms, require the specification of only a few, physically meaningful parameters. Full details of the algorithm, including the setting of boundary conditions and computational demands are provided. Numerical simulation results are presented.