Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

End-to-End Sound Source Separation Conditioned On Instrument Labels

Title End-to-End Sound Source Separation Conditioned On Instrument Labels
Publication Type Conference Paper
Year of Publication 2019
Conference Name 2019 International Conference on Acoustics, Speech, and Signal Processing
Authors Slizovskaia, O. , Kim L. , Haro G. , & Gómez E.
Conference Start Date 12/05/2019
Publisher IEEE
Conference Location Brighton, United Kingdom
Abstract Can we perform an end-to-end sound source separation with a variable number of sources using a deep learning model? This paper presents an extension of the Wave-U-Net model which allows end-to-end monaural source separation with a non-fixed number of sources. Furthermore, we propose multiplicative conditioning with instrument labels at the bottleneck of the Wave-U-Net and show its effect on the separation results. This approach can be further extended to other types of conditioning such as audio-visual and score-informed source separation.
preprint/postprint document https://arxiv.org/abs/1811.01850