Transforming high-effort voices into breathy voices using adaptive pre-emphasis linear prediction

Nordstrom, Karl

Transforming high-effort voices into breathy voices using adaptive pre-emphasis linear prediction

dc.contributor.author	Nordstrom, Karl
dc.contributor.supervisor	Driessen, Peter
dc.date.accessioned	2008-04-29T20:51:14Z
dc.date.available	2008-04-29T20:51:14Z
dc.date.copyright	2008	en_US
dc.date.issued	2008-04-29T20:51:14Z
dc.degree.department	Dept. of Electrical and Computer Engineering	en_US
dc.degree.level	Doctor of Philosophy Ph.D.	en_US
dc.description.abstract	During musical performance and recording, there are a variety of techniques and electronic effects available to transform the singing voice. The particular effect examined in this dissertation is breathiness, where artificial noise is added to a voice to simulate aspiration noise. The typical problem with this effect is that artificial noise does not effectively blend into voices that exhibit high vocal effort. The existing breathy effect does not reduce the perceived effort; breathy voices exhibit low effort. A typical approach to synthesizing breathiness is to separate the voice into a filter representing the vocal tract and a source representing the excitation of the vocal folds. Artificial noise is added to the source to simulate aspiration noise. The modified source is then fed through the vocal tract filter to synthesize a new voice. The resulting voice sounds like the original voice plus noise. Listening experiments were carried out. These listening experiments demonstrated that constant pre-emphasis linear prediction (LP) results in an estimated vocal tract filter that retains the perception of vocal effort. It was hypothesized that reducing the perception of vocal effort in the estimated vocal tract filter may improve the breathy effect. This dissertation presents adaptive pre-emphasis LP (APLP) as a technique to more appropriately model the spectral envelope of the voice. The APLP algorithm results in a more consistent vocal tract filter and an estimated voice source that varies more appropriately with changes in vocal effort. This dissertation describes how APLP estimates a spectral emphasis filter that can transform the spectral envelope of the voice, thereby reducing the perception of vocal effort. A listening experiment was carried out to determine whether APLP is able to transform high effort voices into breathy voices more effectively than constant pre-emphasis LP. The experiment demonstrates that APLP is able to reduce the perceived effort in the voice. In addition, the voices transformed using APLP sound less artificial than the same voices transformed using constant pre-emphasis LP. This indicates that APLP is able to more effectively transform high-effort voices into breathy voices.	en_US
dc.identifier.bibliographicCitation	K. I. Nordstrom, G. Tzanetakis and P. F. Driessen, “Transforming high-effort voices into breathy voices using adaptive pre-emphasis linear prediction“, IEEE Transactions on Audio, Speech and Language Processing (accepted for publication).	en_US
dc.identifier.bibliographicCitation	K. I. Nordstrom and P. F. Driessen, “Variable preemphasis LPC for modeling vocal effort in the singing voice“, Proceedings of the 9th International Conference on Digital Audio Effects (DAFx06), Montreal, QC, Canada, September 2006.	en_US
dc.identifier.bibliographicCitation	K. I. Nordstrom, P. F. Driessen, and G. A. Rutledge, “Influence of the LPC filter upon the perception of breathiness and vocal effort“, IEEE Int. Symposium on Signal Processing and Information Technology (ISSPIT06), Vancouver, BC, Canada, August 2006.	en_US
dc.identifier.bibliographicCitation	K. I. Nordstrom, G. A. Rutledge, P. F. Driessen, “Using voice conversion as a paradigm for analyzing breath quality“, IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PacRim05), Victoria, BC, Canada, August, 2005.	en_US
dc.identifier.uri	http://hdl.handle.net/1828/916
dc.language	English	eng
dc.language.iso	en	en_US
dc.rights	Available to the World Wide Web	en_US
dc.subject	voice transformation	en_US
dc.subject	voice modeling	en_US
dc.subject	voice	en_US
dc.subject	linear prediction	en_US
dc.subject	LPC	en_US
dc.subject	APLP	en_US
dc.subject	adaptive pre-emphasis	en_US
dc.subject	voice quality	en_US
dc.subject	vocal tract filter	en_US
dc.subject	formant filter	en_US
dc.subject	voice source	en_US
dc.subject	glottal source	en_US
dc.subject.lcsh	UVic Subject Index::Sciences and Engineering::Engineering::Electrical engineering	en_US
dc.title	Transforming high-effort voices into breathy voices using adaptive pre-emphasis linear prediction	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Nordstrom_dissertation_final.pdf
Size:: 3.43 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.95 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Electronic Theses and Dissertations (ETD)
Theses (Electrical and Computer Engineering)