Source Separation Methods for Orchestral Music: Timbre-Informed and Score-Informed Strategies
Title | Source Separation Methods for Orchestral Music: Timbre-Informed and Score-Informed Strategies |
Publication Type | PhD Thesis |
Year of Publication | 2018 |
University | Universitat Pompeu Fabra, Barcelona |
Authors | Miron, M. |
Advisor | Gomez, E. , & Janer J. |
Academic Department | Department of Information and Communication Technologies |
Number of Pages | 299 |
Date Published | 02/2018 |
City | Barcelona |
Abstract |
Humans are able to distinguish between various sound sources in their environment and selectively attend to specific ones. However, it is a difficult task to teach a computer to automatically separate the acoustic scene into sources and solely focus on specific elements. This signal processing task is commonly known as audio source separation and involves recovering the sources which are mixed together in a combined signal. This thesis is concerned with source separation of Western classical music mixtures, namely orchestral music. Being able to separate the audio corresponding to the instruments allows for interesting applications such as focusing on a particular section in the orchestra or re-creating the experience of a concert in virtual reality. Additionally, the separated instrument tracks can be further analyzed by other music information research algorithms which perform better on these signals than on the audio signal of the mixture.
Music source separation improves if we know which instruments are present in the piece, and if we have the score e.g. the notes played by each instrument. In fact, the more information we have about a music piece, %the more we can restrict our model, and
Towards an objective evaluation, in the second part of the thesis we propose an orchestral music dataset accompanied by score annotations and an evaluation methodology which assesses the influence of difference parts of the separation framework.
Most of this work follows the research reproducibility principles, inasmuch the datasets, code, software prototypes, published papers, and project reports are made available along with the necessary instructions. |
Final publication | https://doi.org/10.5281/zenodo.1163675 |
Full Text |
Available on zenodo . |
Datasets
We propose the PHENICX-Anechoic dataset which relies on the Aalto orchestral anechoic recordings. We denoised the original recordings and annotated each of the tracks corresponding to the instrument groups.
The Bach10 recordings synthesized with Sibelius can be found on the zenodo repository .
Code
The note refinement code in part three of the thesis is on github . We do not have the rights to distribute the NMF framework and to some extent note refinement is integrated with this framework. However, multi-channel score-informed source separation can be computed using the Repovizz website. You need to create a datapack with datapack designer , upload it and select the checkbox which computes the source separation. The uploaded datapack will comprise the separated tracks.
Code and detailed instructions on how to reproduce experiments in part four (source separation using deep learning) can be found on the associated github repository . The separation results and the computed metrics with BSS Eval can be found on the zenodo page . Similarly, for the score-informed version, check the zenodo page .