This paper presents a system for morphing between audio recordings in a continuous parameter space.
The proposed approach
combines matrix decompositions used for audio source separation with displacement interpolation enabled by 1D optimal transport. By interpolating the spectral components obtained using nonnegative matrix factorization of the source and target signals, the
system allows varying the timbre of a sound in real time, while
maintaining its temporal structure. Using harmonic / percussive
source separation as a pre-processing step, the system affords more
detailed control of the interpolation in perceptually meaningful dimensions.