Expression Control in Singing Voice Synthesis: Features, Approaches, Evaluation, and Challenges

Umbert, M.; Bonada, J.; Goto, M.; Nakano, T.; Sundberg, J.

Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

Expression Control in Singing Voice Synthesis: Features, Approaches, Evaluation, and Challenges

Title	Expression Control in Singing Voice Synthesis: Features, Approaches, Evaluation, and Challenges
Publication Type	Journal Article
Year of Publication	2015
Authors	Umbert, M. , Bonada J. , Goto M. , Nakano T. , & Sundberg J.
Journal Title	IEEE Signal Processing Magazine
Volume	32
Issue	6
Pages	55-73
Journal Date	11/2015
ISSN	1053-5888
Abstract	In the context of singing voice synthesis, expression control manipulates a set of voice features related to a particular emotion, style, or singer. Also known as performance modeling, it has been approached from different perspectives and for different purposes, and different projects have shown a wide extent of applicability. The aim of this article is to provide an overview of approaches to expression control in singing voice synthesis. Section I introduces some musical applications that use singing voice synthesis techniques to justify the need for an accurate control of expression. Then, expression is defined and related to speech and instrument performance modeling. Next, Section II presents the commonly studied set of voice parameters that can change perceptual aspects of synthesized voices. Section III provides, as the main topic of this review, an up-to-date classification, comparison, and description of a selection of approaches to expression control. Then, Section IV describes how these approaches are currently evaluated and discusses the benefits of building a common evaluation framework and adopting perceptually-motivated objective measures. Finally, Section V discusses the challenges that we currently foresee.
preprint/postprint document	http://hdl.handle.net/10230/37266
Final publication	https://doi.org/10.1109/MSP.2015.2424572

Additional material:

All cited sounds have been collected here .