Extending the folksonomies of freesound.org using content-based audio analysis

Publication TypeConference Paper
Year of Publication2009
Conference NameSound and Music Computing Conference
AuthorsMartínez, E., Celma Ò., Sordo M., De Jong, B, & Serra X.
Conference Start Date23/07/2009
Conference LocationPorto, Portugal
Keywordscollaborative tagging, content-based audio similarity, folksonomy, human assessment, nearest neighbor, sound collection

This paper presents Freesound.org, an online community where users share and browse audio files by means of tags, and content–based audio similarity search.

We performed two analyses of the sound collection. The first one is related with how the users tag the sounds, and some well–known problems that occur in collaborative tagging systems were detected (i.e. polysemy, synonymy, and the scarcity of the existing annotations). Moreover, we notice that more than 11% of the collection were scarcely annotated with only one or two tags, thus frustrating the retrieval task. In this sense, the second analysis focuses on enhancing the semantic annotations of these sounds, by means of content–based audio similarity. In order to “autotag” the sounds, we use a k–NN classifier that selects the available tags from the most similar sounds.

Human assessment is performed in order to evaluate the perceived quality of the candidate tags. The results show that, in 77% of the sounds used, the annotations have been correctly extended with the proposed tags.

Published documentfiles/publications/SMC09_emartinez_ocelma_msordo_bdejong_xserra.pdf