Abstract
Cochlea-scaled entropy (CSE) was proposed as a signal-based metric for automatic detection of speech regions most important for intelligibility, but its proposed superiority over traditional linguistic and psychoacoustical characterisations was not subsequently confirmed. This paper shows that the CSE concept is closely related to intensity and as such captures similar speech regions. However, a slight but significant advantage of a CSE over an intensity-based characterisation was observed, associated with a time difference between the two metrics, suggesting that the CSE index may capture dynamical properties of the speech signal crucial for intelligibility.
Original language | English |
---|---|
Pages (from-to) | EL443-EL448 |
Number of pages | 6 |
Journal | Journal of the Acoustical Society of America |
Volume | 143 |
Issue number | 6 |
DOIs | |
Publication status | Published - 2018 |
Keywords
- cochlea
- signal processing
- speech perceptiopn
- speech synthesis