Skip to main navigation Skip to search Skip to main content

Acoustic and visual adaptations in speech produced to counter adverse listening conditions

Research output: Chapter in Book / Conference PaperConference Paperpeer-review

5 Citations (Scopus)

Abstract

This study investigated whether communication modality affects talkers' speech adaptation to an interlocutor exposed to background noise. It was predicted that adaptations to lip gestures would be greater and acoustic ones reduced when communicating face-to-face. We video recorded 14 Australian-English talkers (Talker A) speaking in a face-toface or auditory only setting with their interlocutors who were either in quiet or noise. Focusing on keyword productions, acoustic-phonetic adaptations were examined via measures of vowel intensity, pitch, keyword duration, vowel F1/F2 space and VOT, and visual adaptations via measures of vowel interlip area. The interlocutor adverse listening conditions lead Talker A to reduce speech rate, increase pitch and expand vowel space. These adaptations were not significantly reduced in the face-to-face setting although there was a trend for a smaller degree of vowel space expansion than in the auditory only setting. Visible lip gestures were more enhanced overall in the face-to-face setting, but also increased in the auditory only setting when countering the effects of noise. This study therefore showed only small effects of communication modality on speech adaptations.
Original languageEnglish
Title of host publicationProceedings of the 12th International Conference on Auditory-Visual Speech Processing, 29 August - 1 September 2013, Annecy, France
PublisherInria
Pages93-98
Number of pages6
Publication statusPublished - 2013
EventInternational Conference on Auditory-Visual Speech Processing -
Duration: 29 Aug 2013 → …

Publication series

Name
ISSN (Print)2308-975X

Conference

ConferenceInternational Conference on Auditory-Visual Speech Processing
Period29/08/13 → …

Fingerprint

Dive into the research topics of 'Acoustic and visual adaptations in speech produced to counter adverse listening conditions'. Together they form a unique fingerprint.

Cite this