Corpora for Music Information Research in Indian Art Music

TitleCorpora for Music Information Research in Indian Art Music
Publication TypeConference Paper
Year of Publication2014
Conference NameInternational Computer Music Conference/Sound and Music Computing Conference
AuthorsSrinivasamurthy, A., Koduri G. K., Gulati S., Ishwar V., & Serra X.
Conference Start Date14/09/2014
Conference LocationAthens, Greece
AbstractResearch corpora are representative collections of data and are essential to develop data-driven approaches in Music Information Research (MIR). We address the problem of building research corpora for MIR in Indian art music traditions of Hindustani and Carnatic music, considering several relevant criteria for building such corpora. We also discuss a methodology to assess the corpora based on these criteria and present an evaluation of the corpora in their coverage and completeness. In addition to the corpora, we briefly describe the test datasets that we have built for use in many research tasks. In specific, we describe the tonic dataset, the Carnatic rhythm dataset, the Carnatic varnam dataset, and the Mridangam stroke dataset. The criteria and the evaluation methodology discussed in this article can be used to systematically build a representative and comprehensive research corpus. The corpora and the datasets are accessible to the research community from a central online repository.