|Title||What/when causal expectation modelling applied to percussive audio |
|Publication Type||Journal Article |
|Year of Publication||2008 |
|Authors||Hazan, A., Brossier P., Marxer R., & Purwins H. |
|Journal Title||The Journal of the Acoustical Society of America, vol. 123, issue 5, p. 3800 |
|Abstract||A causal system to represent a stream of music into musical events, and generate further expected events, is presented. Starting from an auditory front-end which extracts low and mid-level features, an unsupervised clustering process builds and maintains a set of symbols aimed at representing musical stream events using both timbre and time descriptions. These symbols are then processed by an expectation module using a multiscale technique based on N-Grams. To characterize the ability of the system to generate an expectation that matches both ground truth and system transcription, we introduce several measures that take into account the uncertainty associated with the unsupervised encoding of the musical sequence. We compare three approaches to combine timing (when) and timbre (what) expectation.
|Full Document||http://mtg.upf.edu/files/publications/Acoustics2008_ahazan_final_small.pdf |