Abstract
Listening to speech in competing background speech can be difficult due to elements such as the linguistic content of the signal. Linguistic release from masking occurs when altering the masker language results in less interference in speech recognition. The greater the linguistic differences between the target and masker, the higher the speech recognition accuracy. However, for dialectal variations of the same language, these patterns are less consistent. This study examined speech-in-speech recognition in Australian English monolinguals when the target speech was in either Australian (AU) or American English, and when the masker speech was in either of the dialects or a foreign language (Swedish). Speech recognition performance was greatest when AU was the target and poorest when AU was the masker. There were fewer differences in performance between the Swedish and dialect maskers. Results indicate that speech recognition is modulated by a listener's familiarity to a dialect.
Original language | English |
---|---|
Title of host publication | Proceedings of INTERSPEECH 2023, Dublin, Ireland, 20-24 August 2023 |
Subtitle of host publication | Inclusive Spoken Language Science and Technology - Breaking Down Barriers |
Editors | Naomi Harte, Julie Carson-Berndsen, Gareth Jones |
Place of Publication | Dublin, Ireland |
Pages | 3113-3116 |
Number of pages | 4 |
DOIs | |
Publication status | Published - 2023 |
Event | 24th International Speech Communication Association, Interspeech 2023 - Dublin, Ireland Duration: 20 Aug 2023 → 24 Aug 2023 |
Publication series
Name | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
---|---|
ISSN (Print) | 2308-457X |
Conference
Conference | 24th International Speech Communication Association, Interspeech 2023 |
---|---|
Country/Territory | Ireland |
City | Dublin |
Period | 20/08/23 → 24/08/23 |
Bibliographical note
Publisher Copyright:© 2023 International Speech Communication Association. All rights reserved.
Keywords
- dialect variation
- familiarity to dialect
- foreign language
- speech-in-speech recognition
- target-masker similarity