Esophageal Voice Enhancement by Modeling Radiated Pulses in Frequency Domain

Loscos, A.; Bonada, J.

Note: This bibliographic page is archived and will no longer be updated. For an up-to-date list of publications from the Music Technology Group see the Publications list .

Esophageal Voice Enhancement by Modeling Radiated Pulses in Frequency Domain

Title	Esophageal Voice Enhancement by Modeling Radiated Pulses in Frequency Domain
Publication Type	Conference Paper
Year of Publication	2006
Authors	Loscos, A. , & Bonada J.
Abstract	Altough esophageal speech has demonstrated to be the most popular voice recovering method after laryngectomy surgery, it is difficult to master and shows a poor degree of intelligibility. This article proposes a new method for esophageal voice enhancement using speech digital signal processing techniques based on modeling radiated voice pulses in frequency domain. The analysis-transformation-synthesis technique creates a non-pathological spectrum for those utterances featured as voiced and filters those unvoiced. Healthy spectrum generation implies transforming the original timbre, modeling harmonic phase coupling from the spectral shape envelope, and deriving pitch from frame energy analysis. Resynthesized speech aims to improve intelligibility, minimize artificial artifacts, and acquire resemblance to patients pre-surgery original voice.
preprint/postprint document	files/publications/9d0455-AES121-aloscos-jonada.pdf