Download Water Bottle Synthesis With Modal Signal Processing
We present a method for accurately synthesizing the acoustic response of a water bottle using modal signal processing. We start with extensive measurements of two water bottles with considerations for how the level of water inside the bottles, the area covered by stickers attached to the exterior of the bottles, and the method of striking the bottles affect their sound. We perform modal analysis of these measurements and implement a real-time modal water bottle synthesizer.
Download Audio Morphing Using Matrix Decomposition and Optimal Transport
This paper presents a system for morphing between audio recordings in a continuous parameter space. The proposed approach combines matrix decompositions used for audio source separation with displacement interpolation enabled by 1D optimal transport. By interpolating the spectral components obtained using nonnegative matrix factorization of the source and target signals, the system allows varying the timbre of a sound in real time, while maintaining its temporal structure. Using harmonic / percussive source separation as a pre-processing step, the system affords more detailed control of the interpolation in perceptually meaningful dimensions.
Download Advanced Fourier Decomposition for Realistic Drum Synthesis
This paper presents a novel method of analysing drum sounds, demonstrating that this can form the basis of a highly realistic synthesis technique for real-time use. The synthesis method can be viewed as an extension of IFFT synthesis; here we exploit the fact that audio signals can be recovered from solely the real component of their discrete Fourier transform (RDFT). All characteristics of an entire drum sample can therefore be conveniently encoded in a single, real-valued, frequency domain signal. These signals are interpreted, incorporating the physics of the instrument, and modelled to investigate how the perceptual features are encoded. The model was able to synthesize drum sound components in such detail that they could not be distinguished in an ABX test. This method may therefore be capable of outperforming existing synthesis techniques, in terms of realism. Sound examples available here.
Download Efficient Snare-Drum Model for Acoustic Interfaces With Piezoelectric Sensors
This paper describes a computationally efficient synthesis model for snare drum sounds. Its parameters can be modulated at audio rate while being played. The input to the model is an acoustic excitation signal which carries spectral information to color the output sound. This makes it suitable for acoustic interfaces – devices which provide excitation signal and control data simultaneously. The presented synthesis model builds up on work done by Miller Puckette and processes audio input from a piezoelectric microphone into a nonlinear reverberator. This paper details a strikingly simple but novel approach on how to make use of the momentary DC offset generated by piezoelectric microphones when pressed to simulate the changes in drumhead tension. This technique is especially of interest for interfaces without pressure sensing capabilities. In the design process we pursued an experimental approach rather than a purely mathematical. Implementations of the synthesis model are provided for Pure Data and FAUST as open source.
Download Adversarial Synthesis of Drum Sounds
Recent advancements in generative audio synthesis have allowed for the development of creative tools for generation and manipulation of audio. In this paper, a strategy is proposed for the synthesis of drum sounds using generative adversarial networks (GANs). The system is based on a conditional Wasserstein GAN, which learns the underlying probability distribution of a dataset compiled of labeled drum sounds. Labels are used to condition the system on an integer value that can be used to generate audio with the desired characteristics. Synthesis is controlled by an input latent vector that enables continuous exploration and interpolation of generated waveforms. Additionally we experiment with a training method that progressively learns to generate audio at different temporal resolutions. We present our results and discuss the benefits of generating audio with GANs along with sound examples and demonstrations.
Download Complementary N-Gon Waves and Shuffled Samples Noise
This paper introduces complementary n-gon waves and the shuffled samples noise effect. N-gon waves retain angles of the regular polygons and star polygons of which they are derived from in the waveform itself. N-gon waves are researched by the author since 2000 and were introduced to the public at ICMC|SMC in 2014. Complementary n-gon waves consist of an n-gon wave and a complementary angular wave. The complementary angular wave introduced in this paper complements an n-gon wave so that the two waveforms can be used to reconstruct the polygon of which the waveforms were derived from. If it is derived from a star polygon, it is not an n-gon wave and has its own characteristics. Investigations into how geometry, audio, visual and perception are related led to experiments with complementary n-gon waves and a shuffle algorithm. It is possible to reconstruct a digitised geometric shape from its shuffled samples and visualise the geometric shape with shuffled samples noise signals on a digital display device or also, within some limitations, on an oscilloscope in X-Y mode. This paper focuses on the description of discrete complementary n-gon waves and how a Fisher-Yates shuffle algorithm was applied to these waveforms and used to create the shuffled samples noise effect. In addition, some of the timbral and spatial characteristics of complementary n-gon waves and shuffled samples noise are outlined and audiovisual applications of these waveforms briefly discussed.
Download Accelerating Matching Pursuit for Multiple Time-Frequency Dictionaries
Matching pursuit (MP) algorithms are widely used greedy methods to find K-sparse signal approximations in redundant dictionaries. We present an acceleration technique and an implementation of the matching pursuit algorithm acting on a multi-Gabor dictionary, i.e., a concatenation of several Gabor-type time-frequency dictionaries, consisting of translations and modulations of possibly different windows, time- and frequency-shift parameters. The proposed acceleration is based on pre-computing and thresholding inner products between atoms and on updating the residual directly in the coefficient domain, i.e., without the round-trip to the signal domain. Previously, coefficient-domain residual updates have been dismissed as having prohibitive memory requirements. By introducing an approximate update step, we can overcome this restriction and greatly improve the performance of matching pursuit at a modest cost in terms of approximation quality per selected atom. An implementation in C with Matlab and GNU Octave interfaces is available, outperforming the standard Matching Pursuit Toolkit (MPTK) by a factor of 3.5 to 70 in the tested conditions. Additionally, we provide experimental results illustrating the convergence of the implementation.
Download A String in a Room: Mixed-Dimensional Transfer Function Models for Sound Synthesis
Physical accuracy of virtual acoustics receives increasing attention due to renewed interest in virtual and augmented reality applications. So far, the modeling of vibrating objects as point sources is a common simplification which neglects effects caused by their spatial extent. In this contribution, we propose a technique for the interconnection of a distributed source to a room model, based on a modal representation of source and room. In particular, we derive a connection matrix that describes the coupling between the modes of the source and the room modes in an analytical form. Therefore, we consider the example of a string that is oscillating in a room. Both, room and string rely on well established physical descriptions that are modeled in terms of transfer functions. The derived connection of string and room defines the coupling between the characteristic string and room modes. The proposed structure is analyzed by numerical evaluations and sound examples on the supplementary website.
Download A Finite Difference Model for Articulated Slide-String Simulation
Slide-string instruments allow continuous control of pitch by articulation with a slide object whose position of contact with the string is time-varying. This paper presents a method for simulation of such articulation. Taking into account sensing and musical practice considerations, an appropriate physical model configuration is determined, which is then formulated in numerical form using a finite difference approach. The model simulates the attachment and detachment phases of slide articulation which generally involve rattling, while finger damping is modelled in a more phenomenological manner as a regionally induced time-varying damping. A stability bound for the numerical model is provided via energy analysis, which also reveals the driving power contributions of the separate articulatory sources. The approach is exemplified with simulations of slide articulatory gestures that involve glissando, vibrato and finger damping.
Download A Power-Balanced Dynamic Model of Ferromagnetic Coils
This paper proposes a new macroscopic physical model of ferromagnetic coils used in audio circuits. To account for realistic saturation and hysteretic phenomena, this model combines statistical physics results, measurement-driven refinements and portHamiltonian formulations that guarantee passivity, thermodynamic consistency and composability according to both electric and thermal ports. As an illustration, the model is used to simulate a passive high-pass filter. Different types of audio inputs are considered and simulations are compared to measurements.