Download The development of an online course in DSP eartraining The authors present a collaborative effort on establishing an online course in DSP eartraining. The paper reports from a preliminary workshop that covered a large range of topics such as eartraining in music education, terminology for sound characterization, e-learning, automated tutoring, DSP techniques, music examples and audio programming. An initial design of the web application is presented as a rich content database with flexible views to allow customized online presentations. Technical risks have already been mitigated through prototyping.
Download Musical Sound Effects in the SAS Model Spectral models provide general representations of sound in which many audio effects can be performed in a very natural and musically expressive way. Based on additive synthesis, these models control many sinusoidal oscillators via a huge number of model parameters which are only remotely related to musical parameters as perceived by a listener. The Structured Additive Synthesis (SAS) sound model has the flexibility of additive synthesis while addressing this problem. It consists of a complete abstraction of sounds according to only four parameters: amplitude, frequency, color, and warping. Since there is a close correspondence between the SAS model parameters and perception, the control of the audio effects gets simplified. Many effects thus become accessible not only to engineers, but also to musicians and composers. But some effects are impossible to achieve in the SAS model. In fact structuring the sound representation imposes limitations not only on the sounds that can be represented, but also on the effects that can be performed on these sounds. We demonstrate these relations between models and effects for a variety of models from temporal to SAS, going through well-known spectral models.
Download Computer Synthesis Of Bird Songs And Calls Bird songs are fascinating acoustic phenomena. In this paper, we first examine the acoustic mechanisms of sound production in birds and contrast them with their anatomy. Next, we describe the simulation of a one-mass source together with a simple transmission line model for a psittacine bird. In concluding, we discuss future areas for research.
Download The Image-Source Reverberation Model in an N-Dimensional Space The image method is generalized to geometries with an arbitrary number of spatial dimensions. n-dimensional (n-D) acoustics is discussed, and an algorithm for n-D room impulse response calculations is presented. Synthesized room impulse responses (RIRs) from n-D rooms are presented. RIR characteristics are discussed, and computational considerations are examined.
Download Stable Structures for Nonlinear Biquad Filters Biquad filters are a common tool for filter design. In this writing,
we develop two structures for creating biquad filters with nonlinear elements. We provide conditions for the guaranteed stability of
the nonlinear filters, and derive expressions for instantaneous pole
analysis. Finally, we examine example filters built with these nonlinear structures, and show how the first nonlinear structure can be
used in the context of analog modelling.
Download Time Varying Frequency Warping: Results And Experiments Dispersive tapped delay lines are attractive structures for altering the frequency content of a signal. In previous papers we showed that in the case of a homogeneous line with first order all-pass sections the signal formed by the output samples of the chain of delays at a given time is equivalent to compute the Laguerre transform of the input signal. However, most musical signals require a time-varying frequency modification in order to be properly processed. Vibrato in musical instruments or voice intonation in the case of vocal sounds may be modeled as small and slow pitch variations. Simulations of these effects require techniques for time- varying pitch and/or brightness modification that are very useful for sound processing. In our experiments the basis for time-varying frequency warping is a time-varying version of the Laguerre transformation. The corre- sponding implementation structure is obtained as a dispersive tapped delay line, where each of the frequency dependent delay element has its own phase response. Thus, time-varying warping results in a space-varying, inhomogeneous, propagation structure. We show that time-varying frequency warping may be associated to expansion over biorthogonal sets generalizing the discrete Laguerre basis. Slow time-varying characteristics lead to slowly varying parameter sequences. The corresponding sound transformation does not suffer from discontinuities typical of delay lines based on unit delays.
Download Audio Nonlinear Modeling through Hyperbolic Tangent Functionals In the present contribution we present the preliminary results of a black box nonlinear system (NLS) modeling. The NLS is composed by a nonlinear sigmoid-type input-output relationship (NLTF) followed by a linear system (LTI), as in a Hammerstein nonlinear system. Here, the used NLTF is derived from a deformation of the Hyperbolic Tangent power expansion. The advantage of using the hyperbolic tangent function is that nonlinearity depends on the linear and cubic terms that measure curvature (and thus nonlinearity) of the transfer function. The hyperbolic tangent model is extended to other types of nonlinear systems by expanding the nonlinear system in linear and increasingly nonlinear contributions, where the expansion parameters are deformed to enhance or suppress specific nonlinear modes of the expansion. Simulations were performed using Matlab 2012a. The preliminary results show fairly good agreement between the system obtained by parametric inference and a reference system, with mean square error (MSE)=0.035.
Download An Interdisciplinary Approach to Audio Effect Classification The aim of this paper is to propose an interdisciplinary classification of digital audio effects to facilitate communication and collaborations between DSP programmers, sound engineers, composers, performers and musicologists. After reviewing classifications reflecting technological, technical and perceptual points of view, we introduce a transverse classification to link disciplinespecific classifications into a single network containing various layers of descriptors, ranging from low-level features to high-level features. Simple tools using the interdisciplinary classification are introduced to facilitate the navigation between effects, underlying techniques, perceptual attributes and semantic descriptors. Finally, concluding remarks on implications for teaching purposes and for the development of audio effects user interfaces based on perceptual features rather than technical parameters are presented.
Download Sound texture synthesis using Convolutional Neural Networks The following article introduces a new parametric synthesis algorithm for sound textures inspired by existing methods used for visual textures. Using a 2D Convolutional Neural Network (CNN), a sound signal is modified until the temporal cross-correlations of the feature maps of its log-spectrogram resemble those of a target texture. We show that the resulting synthesized sound signal is both different from the original and of high quality, while being able to reproduce singular events appearing in the original. This process is performed in the time domain, discarding the harmful phase recovery step which usually concludes synthesis performed in the time-frequency domain. It is also straightforward and flexible, as it does not require any fine tuning between several losses when synthesizing diverse sound textures. Synthesized spectrograms and sound signals are showcased, and a way of extending the synthesis in order to produce a sound of any length is also presented. We also discuss the choice of CNN, border effects in our synthesized signals and possible ways of modifying the algorithm in order to improve its current long computation time.