End-to-End Sound Source Separation Conditioned On Instrument Labels

Olga Slizovskaia; Kim, Leo; Haro, Gloria; Emilia Gómez

Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

End-to-End Sound Source Separation Conditioned On Instrument Labels

Title	End-to-End Sound Source Separation Conditioned On Instrument Labels
Publication Type	Conference Paper
Year of Publication	2019
Conference Name	2019 International Conference on Acoustics, Speech, and Signal Processing
Authors	Slizovskaia, O. , Kim L. , Haro G. , & Gómez E.
Conference Start Date	12/05/2019
Publisher	IEEE
Conference Location	Brighton, United Kingdom
Abstract	Can we perform an end-to-end sound source separation with a variable number of sources using a deep learning model? This paper presents an extension of the Wave-U-Net model which allows end-to-end monaural source separation with a non-fixed number of sources. Furthermore, we propose multiplicative conditioning with instrument labels at the bottleneck of the Wave-U-Net and show its effect on the separation results. This approach can be further extended to other types of conditioning such as audio-visual and score-informed source separation.
preprint/postprint document	https://arxiv.org/abs/1811.01850