Spectral enhancement of whispered speech based on probability mass function

Hamid Reza Sharifzadeh, Ian Vince McLoughlin, Farzaneh Ahmadi

    Research output: Chapter in Book / Conference PaperConference Paperpeer-review

    5 Citations (Scopus)

    Abstract

    Whispered speech can be effectively used for quiet and private communications over mobile phones and is also the communication means for ENT patients under a regime of voice rest. The reconstruction of natural sounding speech from such whispers can be useful for several types of application across different scientific fields ranging from communications to biomedical engineering. Despite the useful applications for a such technology, the reconstruction of natural speech from whispers has received relatively little research effort to date. This paper presents novel methods for spectral enhancement and formant smoothing with the aim of attaining more natural sounding speech within the reconstruction process. The proposed approach uses a probability mass-density function to identify a reliable formant trajectory through whispers and apply vocal modifications accordingly. Subjective evaluation experiments were performed, and are reported, to assess the performance of the techniques. A method for the near real-time conversion of whispers to normal phonated speech through a modified CELP codec has been discussed in our previously published work which, the proposed formant modification approach in this paper builds upon.
    Original languageEnglish
    Title of host publicationProceedings of The Sixth Advanced International Conference on Telecommunications, AICT 2010, 9-15 May 2010, Barcelona, Spain
    PublisherIEEE
    Pages207-211
    Number of pages5
    ISBN (Print)9780769540214
    DOIs
    Publication statusPublished - 2010
    EventAdvanced International Conference on Telecommunications -
    Duration: 9 May 2010 → …

    Conference

    ConferenceAdvanced International Conference on Telecommunications
    Period9/05/10 → …

    Keywords

    • linear predictive coding
    • spectral enhancement
    • speech synthesis
    • whispered speech
    • whispers

    Fingerprint

    Dive into the research topics of 'Spectral enhancement of whispered speech based on probability mass function'. Together they form a unique fingerprint.

    Cite this