The AcousticBrainz Genre Dataset: Multi-Source, Multi-Level, Multi-Label, and Large-Scale

TitleThe AcousticBrainz Genre Dataset: Multi-Source, Multi-Level, Multi-Label, and Large-Scale
Publication TypeConference Paper
Year of Publication2019
Conference Name20th International Society for Music Information Retrieval Conference (ISMIR 2019)
AuthorsBogdanov, D., Porter A., Schreiber H., Urbano J., & Oramas S.
Conference Start Date04/11/2019
Conference LocationDelft, The Netherlands
AbstractThis paper introduces the AcousticBrainz Genre Dataset, a large-scale collection of hierarchical multi-label genre annotations from different metadata sources. It allows researchers to explore how the same music pieces are annotated differently by different communities following their own genre taxonomies, and how this could be addressed by genre recognition systems. Genre labels for the dataset are sourced from both expert annotations and crowds, permitting comparisons between strict hierarchies and folksonomies. Music features are available via the AcousticBrainz database. To guide research, we suggest a concrete research task and provide a baseline as well as an evaluation method. This task may serve as an example of the development and validation of automatic annotation algorithms on complementary datasets with different taxonomies and coverage. With this dataset, we hope to contribute to developments in content-based music genre recognition as well as cross-disciplinary studies on genre metadata analysis.
preprint/postprint documenthttp://hdl.handle.net/10230/41985
intranet