Download Hierarchical Organization and Visualization of Drum Sample Libraries
Drum samples are an important ingredient for many styles of music. Large libraries of drum sounds are readily available. However, their value is limited by the ways in which users can explore them to retrieve sounds. Available organization schemes rely on cumbersome manual classification. In this paper, we present a new approach for automatically structuring and visualizing large sample libraries through audio signal analysis. In particular, we present a hierarchical user interface for efficient exploration and retrieval based on a computational model of similarity and self-organizing maps.
Download Spatial Auditory Displays - A study on the use of virtual audio environments as interfaces for users with visual disabilities
This paper presents the work on a prototype spatial auditory display. Using high-definition audio rendering a sample application was presented to a mixed group of users with visual disabilities and normal sighted users. The evaluation of the prototype provided insights into how effective spatial presentation of sound can be in terms of human-computer interaction (HCI). It showed that typical applications with the most common interaction tasks like menus, text input and dialogs can be presented very effectively using spatial audio. It also revealed that there is no significant difference in effectiveness between normal sighted and visually impaired users. We believe that spatial auditory displays are capable to provide the visually impaired and blind access to modern information technologies in a more efficient way than common technologies and that they will be inevitable for multimodal displays in future applications.
Download Visualization of Sound as a Control Interface
We here introduce the opportunity of using visualization of sound as a control interface, for artistic live performance as well as for new digital audio effects developments. Two different approaches are exposed. The first access consists in using video matrixes for mapping the variables parameters of sound processing, with the coordinates of any controller in a 2D plane or 3D space. The second access proposes a visualization of sound that modifies sound data by processing the data of the image itself with its own graphical properties. Exploring this kind of « transducting » relation between visual and audio may be interesting for artistic creation domain using virtual surroundings; it may cause an interest for the real time digital audio, for audiovisual mixing and new interfaces for sound design. Besides, it points the opportunity of developing 3D control interfaces for audio and visual processes.
Download Virtual rooms recreation for Wave Field Synthesis
Advanced multichannel sound systems such as Wave Field Synthesis (WFS) allow to recreate spatial wide sound scenes of sources. The recreation of the illusion of a 3D natural and realistic sound scene can be achieved by means of virtual rooms where the wave field is simulated. Such wave field is used as a source of information for the convolution of WFS sound sources with extrapolated impulsive responses in these virtual rooms. To obtain the needed plane waves for auralization, a complete description of the sound field is needed, including an accurate knowledge of the particle velocity. In this paper, virtual rooms are simulated by means of Finite-Differences Time Domain method. This method provides a complete solution of the sound field variables in a wide frequency band and can be used to produce both the impulsive responses of pressure and particle velocity for plane wave decomposition, prior to auralization. To illustrate its applicability, a set of rooms consisting of a typical auditorium room, a cinema and a perfect cube are shown and evaluated.
Download Time and Frequency Domain Room Compensation applied to Wave Field Synthesis
In sound rendering systems using loudspeakers, the listening room adds echoes not considered by the reproduction system, thus deteriorating the rendered audio signal. Specifically, Wave Field Synthesis is a 3D audio reproduction system, which allows synthesizing a realistic sound field in a wide area by using arrays of loudspeakers. This paper proposes a room compensation approach based on a multichannel inverse filter bank calculated to compensate the room effects at selected points within the listening area. Time domain and frequency domain algorithms are proposed to accurately compute the bank of inverse filters. A comparative study between these algorithms by means of laboratory experiments is presented.
Download Source separation for microphone arrays using multichannel conjugate gradient techniques
This paper proposes a new scheme to improve the source separation problem aimed to microphone array applications like WFS based teleconference systems. A multichannel, sub-band approach to reduce computational complexity is presented. Also, instead of using the LMS adaptive algorithm, a new system based on hybrid Conjugate Gradient-nLMS techniques is developed to accelerate the convergence time. This adaptive algorithm is controlled by a voice activity detector block that basically detects double talk situations and freezes the adaptation process to avoid the appearance of sound artifacts which may cause a significant degradation of the recovered signals and have a great impact in the quality of the full system.
Download Alternative analysis-resynthesis approaches for timescale, frequency and other transformations of musical signals
This article presents new spectral analysis-synthesis approaches to musical signal transformation. The analysis methods presented here involve the use of a superior quality technique of frequency estimation, the Instantaneous Frequency Distribution (IFD), and partial tracking. We discuss the theory behind the IFD, comparing it to other existing methods. The partial tracking analysis employed in this process is explained fully. This is followed by a look into the three resynthesis methods proposed by this work, based on different approaches to additive synthesis. A number of transformations of musical signals are proposed to take advantage of the analysis-synthesis techniques. Performance details and specific aspects of this implementation are discussed. This is complemented by a look at some of the results of these methods in the time-stretching of audio signals, where they will be shown to perform better than many of the currently available techniques.
Download A Generalized Polynomial and Sinusoidal Model for Partial Tracking and Time Stretching
In this article, we introduce a new generalized model based on polynomials and sinusoids for partial tracking and time stretching. Nowadays, most partial tracking algorithms are based on the McAulay-Quatieri approach and use polynomials for phase, frequency, and amplitude tracks. Some sinusoidal approaches have also been proved to work in certain conditions. We will present here an unified model using both approaches, which will allow more flexible partial tracking and time stretching.
Download Efficient spectral envelope estimation and its application to pitch shifting and envelope preservation
In this article the estimation of the spectral envelope of sound signals is addressed. The intended application for the developed algorithm is pitch shifting with preservation of the spectral envelope in the phase vocoder. As a first step the different existing envelope estimation algorithms are investigated and their specific properties discussed. As the most promising algorithm the cepstrum based iterative true envelope estimator is selected. By means of controlled sub-sampling of the log amplitude spectrum and by means of a simple step size control for the iterative algorithm the run time of the algorithm can be decreased by a factor of 2.5-11. As a remedy for the ringing effects in the the spectral envelope that are due to the rectangular filter used for spectral smoothing we propose the use of a Hamming window as smoothing filter. The resulting implementation of the algorithm has slightly increased computational complexity compared to the standard LPC algorithm but offers significantly improved control over the envelope characteristics. The application of the true envelope estimator in a pitch shifting application is investigated. The main problems for pitch shifting with envelope preservation in a phase vocoder are identified and a simple yet efficient remedy is proposed.
Download Sound-System Design for a Professional Full-Flight Simulator
In this paper, we present a sound system to be integrated in an accredited realistic full-flight simulator, used for the training of airline pilots. We discuss the design and implementation of a corresponding real-time signal-processing software providing threedimensional audio reproduction of the acoustic events on a flight deck. Here, the emphasis is on an aircraft of a specific type. We address issues of suitable data acquisition methods, and, most importantly, of functional signal analysis and synthesis techniques.