Back Seminar by Gautham Mysore on Non-negative Hidden Markov Modeling of Audio

Seminar by Gautham Mysore on Non-negative Hidden Markov Modeling of Audio

02.10.2012

 

When and where? Thursday, Oct 4, 2012, 3:30pm, 52.321

Host: Xavier Serra (MTG)

Title: Non-negative Hidden Markov Modeling of Audio

Abstract:
Non-negative spectrogram factorization techniques have become quite popular in the last decade as they are effective in modeling the spectral structure of audio. They have been extensively used for applications such as source separation and denoising. These techniques however fail to account for non-stationarity and temporal dynamics, which are two important properties of audio. In this talk, I will introduce the non-negative hidden Markov model (N-HMM) and the non-negative factorial hidden Markov model (N-FHMM) to model single sound sources and sound mixtures respectively. They jointly model the spectral structure and temporal dynamics of sound sources, while accounting for non-stationarity. I will also discuss the application of these models to various applications such as source separation, denoising, and content based audio processing, showing why they yield improved performance when compared to non-negative spectrogram factorization techniques.

Gautham J. Mysore website

Multimedia

Categories:

SDG - Sustainable Development Goals:

Els ODS a la UPF

Contact