Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

Convolutional neural networks for audio processing: starting pack

Title Convolutional neural networks for audio processing: starting pack
Publication Type Miscellaneous
Year of Publication 2017
Authors Miron, M. , & Slizovskaia O.
preprint/postprint document https://pydata.org/barcelona2017/schedule/presentation/37/
Full Text

Neural networks are increasingly popular in audio signal processing for topics as speech recognition or denoising. Scientific papers are usually accompanied by code repositories which rely on libraries as Theano or Tensorflow that can be interfaced from python. However, adapting a system to different tasks and data must take into account a set of pre-training routines and parameter debugging which we will discuss in this tutorial. Starting from the audio signals we introduce the data pre-training steps (feature computation, batch generation, normalization( with examples in numpy or scipy. We summarize the core concepts in neural networks and we code an architecture with the Keras library. Finally, we learn how to visualize and debug parameters with TensorBoard.

The slides can be found here .

The notebooks can be found at the github repository .