Note:
This bibliographic page is archived and will no longer be updated.
For an up-to-date list of publications from the Music Technology Group see the
Publications list
.
Acoustic scene classification by ensembling gradient boosting machine and convolutional neural networks
Title | Acoustic scene classification by ensembling gradient boosting machine and convolutional neural networks |
Publication Type | Conference Paper |
Year of Publication | 2017 |
Conference Name | Workshop on Detection and Classification of Acoustic Scenes and Events |
Authors | Fonseca, E. , Gong R. , Bogdanov D. , Slizovskaia O. , Gomez E. , & Serra X. |
Conference Start Date | 16/11/2017 |
Conference Location | Munich, Germany |
Abstract | This work describes our contribution to the acoustic scene classification task of the DCASE 2017 challenge. We propose a system that consists of the ensemble of two methods of different nature: a feature engineering approach, where a collection of hand-crafted features is input to a Gradient Boosting Machine, and another approach based on learning representations from data, where log-scaled mel-spectrograms are input to a Convolutional Neural Network. This CNN is designed with multiple filter shapes in the first layer. We use a simple late fusion strategy to combine both methods. We report classification accuracy of each method alone and the ensemble system on the provided cross-validation setup of TUT Acoustic Scenes 2017 dataset. The proposed system outperforms each of its component methods and improves the provided baseline system by 8.2%. |
preprint/postprint document | http://hdl.handle.net/10230/33454 |