Download A Robust Stochastic Approximation Method for Crosstalk Cancellation Crosstalk cancellation serves as an important role in binaural signals playback through loudspeakers, which reproduce a particular auditory scene to the listener’s ears. In practice, due to either the listener’s head movement or rotation, etc, the actual transfer function matrix will differ from the design matrix, which results in deterioration in the performance of crosstalk cancellation. Crosstalk cancellation system (CCS) is very non-robust to these perturbations. Generally, in order to improve the robustness of CCS, several pairs of loudspeakers using a multi-band approach processing band-passed content to appropriately spaced loudspeakers are needed. In this paper, by means of assumed stochastic analysis, a stochastic robust approximation method based on random perturbation matrix modeling the variations of the transfer function matrix is introduced and evaluated. Under free-field condition, simulation results demonstrate the effectiveness of the proposed method.
Download HRTF Spatial Upsampling in the Spherical Harmonics Domain Employing a Generative Adversarial Network A Head-Related Transfer Function (HRTF) is able to capture alterations a sound wave undergoes from its source before it reaches the entrances of a listener’s left and right ear canals, and is imperative for creating immersive experiences in virtual and augmented reality (VR/AR). Nevertheless, creating personalized HRTFs demands sophisticated equipment and is hindered by time-consuming data acquisition processes. To counteract these challenges, various techniques for HRTF interpolation and up-sampling have been proposed. This paper illustrates how Generative Adversarial Networks (GANs) can be applied to HRTF data upsampling in the spherical harmonics domain. We propose using Autoencoding Generative Adversarial Networks (AE-GAN) to upsample lowdegree spherical harmonics coefficients and get a more accurate representation of the full HRTF set. The proposed method is benchmarked against two baselines: barycentric interpolation and HRTF selection. Results from log-spectral distortion (LSD) evaluation suggest that the proposed AE-GAN has significant potential for upsampling very sparse HRTFs, achieving 17% improvement over baseline methods.