Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

Score-Informed Source Separation for Multichannel Orchestral Recordings

Title Score-Informed Source Separation for Multichannel Orchestral Recordings
Publication Type Journal Article
Year of Publication 2016
Authors Miron, M. , Carabias-Orti J. J. , Bosch J. , Gómez E. , & Janer J.
Journal Title Journal of Electrical and Computer Engineering
Volume 2016
Journal Date 12/2016
Abstract This paper proposes a system for score-informed audio source separation for multichannel orchestral recordings. The orchestral music repertoire relies on the existence of scores. Thus, a reliable separation requires a good alignment of the score with the audio of the performance. To that extent, automatic score alignment methods are reliable when allowing a tolerance window around the actual onset and offset. Moreover, several factors increase the difficulty of our task: a high reverberant image, large ensembles having rich polyphony, and a large variety of instruments recorded within a distant-microphone setup. To solve these problems, we design context-specific methods such as the refinement of score-following output in order to obtain a more precise alignment. Moreover, we extend a close-microphone separation framework to deal with the distant-microphone orchestral recordings. Then, we propose the first open evaluation dataset in this musical context, including annotations of the notes played by multiple instruments from an orchestral ensemble. The evaluation aims at analyzing the interactions of important parts of the separation framework on the quality of separation. Results show that we are able to align the original score with the audio of the performance and separate the sources corresponding to the instrument sections.
preprint/postprint document https://www.hindawi.com/journals/jece/2016/8363507/
Final publication 10.1155/2016/8363507
Additional material:

The dataset used in this paper can be found at the web page