A non-linear rhythm-based style classifcation for Broadcast Speech-Music Discrimination

Guaus, Enric; Batlle, E.

Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

A non-linear rhythm-based style classifcation for Broadcast Speech-Music Discrimination

Title	A non-linear rhythm-based style classifcation for Broadcast Speech-Music Discrimination
Publication Type	Conference Paper
Year of Publication	2004
Conference Name	116th AES Convention
Authors	Guaus, E. , & Batlle E.
Conference Start Date	08/05/2004
Conference Location	Berlin, Germany
Abstract	Speech-Music discriminators are usually designed under some rigid constrains. This paper deals with a more general Speech-Music Discriminator successfully used in AIDA project. The system is based on a Hidden Markov Model style classication process in which the styles are grouped into two major categories Speech or Music. The goals of this sub-system are (1)the expandible possibilities with the addition of some new styles (like "phone female voice"), (2)the use of new rhytmical descriptors in combination with other typical ones and (3)the robustness of our speech/music discriminator in many dierent environments by using some Mathematical Morphology and non-linear post-processing techniques. The techniques used in our system allow a fast track in changes between styles and, thus, typical confusions in commercials can be easily cleaned. The accuracy of this system can be up to a 94.3% in broadcast radio environment.
preprint/postprint document	files/publications/AES116-eguaus.pdf