What is the Effect of Audio Quality on the Robustness of MFCCs and Chroma Features?

TitleWhat is the Effect of Audio Quality on the Robustness of MFCCs and Chroma Features?
Publication TypeConference Paper
Year of Publication2014
Conference Name15th International Society for Music Information Retrieval Conference
AuthorsUrbano, J., Bogdanov D., Herrera P., Gómez E., & Serra X.
Pagination573-578
Conference Start Date27/10/2014
Conference LocationTaipei, Taiwan
AbstractMusic Information Retrieval is largely based on descriptors computed from audio signals, and in many practical applications they are to be computed on music corpora containing audio files encoded in a variety of lossy formats. Such encodings distort the original signal and therefore may affect the computation of descriptors. This raises the question of the robustness of these descriptors across various audio encodings. We examine this assumption for the case of MFCCs and chroma features. In particular, we analyze their robustness to sampling rate, codec, bitrate, frame size and music genre. Using two different audio analysis tools over a diverse collection of music tracks, we compute several statistics to quantify the robustness of the resulting descriptors, and then estimate the practical effects for a sample task like genre classification.
intranet