Abstract
![CDATA[In the following paper, a method for the real-time conversion of whispers to normal phonated speech through a code excited linear prediction analysis-by-synthesis codec is discussed. This approach uses a template of a speaker's normal phonated speech for extraction of excitation parameters such as pitch and gain, and then injects these estimated excitations into whispered signal to synthesize normal-sounding speech through the CELP codec. Furthermore, since restoring pitch to whispered speech requires some considerations of quality and accuracy, spectral enhancements are required in terms of formant shifting (LSPs modification) and pitch injection based on voiced/unvoiced decision. Spectral shifting is accomplished through line-spectral pair adjustment. Implementing such methods by using the popular CELP codec allows integration of the technique with any modern speech applications and devices. Subjective testing results are presented to determine the effectiveness of the technique.]]
Original language | English |
---|---|
Title of host publication | Proceedings of APCCAS 2008: IEEE Asia Pacific Conference on Circuits and Systems, 30 November - 3 December 2008, Macao, China |
Publisher | IEEE |
Pages | 1280-1283 |
Number of pages | 4 |
ISBN (Print) | 9781424423422 |
DOIs | |
Publication status | Published - 2008 |
Event | IEEE Asia-Pacific Conference on Circuits and Systems - Duration: 2 Dec 2012 → … |
Conference
Conference | IEEE Asia-Pacific Conference on Circuits and Systems |
---|---|
Period | 2/12/12 → … |
Keywords
- speech synthesis
- whispered speech
- whispers