Multi-Class Support Vector Machine Active Learning for Music Annotation

TitleMulti-Class Support Vector Machine Active Learning for Music Annotation
Publication TypeJournal Article
Year of Publication2010
AuthorsChen, G., Wang T. - G., Gong L., & Herrera P.
Journal TitleInternational Journal of Innovative Computing, Information and Control
Issue6
Pages921-930
AbstractMusic annotation is an important research topic in the multimedia area. One of the challenges in music annotation is how to reduce the human effort in labeling music files for building reliable classification models. In the past, there have been many studies on applying support vector machine active learning methods to automatic multimedia data annotation, which try to select the most informative examples for labeling manually. Most of these studies focused on selecting a single unlabeled example in each iteration process for binary classification. As a result, the model has to be retrained after each labeled example is solicited, and the user is likely to lose patience after a few rounds of labeling. In this paper, we present a novel multi-class active learning algorithm that can select multiple music examples for labeling in each iteration process. The key of the multi-sample selection for multi-class active learning is how to reduce the redundancy and avoid selecting the outliers among the selected examples such that each example provides unique information for model updating. To this end, we propose the distance diversity and set density in the support vector machine feature space as the measurement of the scatter of the selected sample set. Experiment results on two music data sets demonstrate the effectiveness of our method. Moreover, although our criterion is designed for music annotation, it can be used in a general frame work.
preprint/postprint documenthttp://hdl.handle.net/10230/41653
intranet