Percussive/harmonic sound separation by non-negative matrix factorization with smoothness/sparseness constraints

TitlePercussive/harmonic sound separation by non-negative matrix factorization with smoothness/sparseness constraints
Publication TypeJournal Article
Year of Publication2014
AuthorsCanadas-Quesada, F. J., Vera-Candeas P., Ruiz-Reyes N., Carabias-Orti J. J., & Cabanas-Molero P.
Journal TitleEURASIP Journal on Audio, Speech, and Music Processing 2014
AbstractIn this paper, unsupervised learning is used to separate percussive and harmonic sounds from monaural non-vocal polyphonic signals. Our algorithm is based on a modified non-negative matrix factorization (NMF) procedure that no labeled data is required to distinguish between percussive and harmonic bases because information from percussive and harmonic sounds is integrated into the decomposition process. NMF is performed in this process by assuming that harmonic sounds exhibit spectral sparseness (narrowband sounds) and temporal smoothness (steady sounds), whereas percussive sounds exhibit spectral smoothness (broadband sounds) and temporal sparseness (transient sounds). The evaluation is performed using several real-world excerpts from different musical genres. Comparing the developed approach to three current state-of-the art separation systems produces promising results.
Final publication10.1186/s13636-014-0026-5
intranet