Abstract
Ultrasonic speech is a novel technology which implies exciting human vocal tract (VT) with an ultrasonic signal to provide a speech mode in the ultrasonic frequency range. This has several applications including speech-aid prostheses for voice-loss patients, silent speech interfaces, secure mode of communication in mobile phones and speech therapy. Linear prediction has recently been proven to be applicable for feature extraction of ultrasonic propagation inside the VT. The authors have proposed that averaging the predictor coefficients obtained from multiple receiving points is a viable approach for autoregressive (AR) modelling of ultrasonic speech. In support of the previous theoretical work, this paper presents experimental results of implementing the averaging method, using finite element analysis of ultrasonic propagation inside the VT configuration for nine English vowels. A comparison of the results with the conventional method of least squares error (LSE) - used in room acoustics - shows that averaging outperforms LSE in terms of determining the location of poles in the AR modelling of ultrasonic speech and demonstrates higher robustness to variations of the LPC order.
Original language | English |
---|---|
Title of host publication | Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010: Spoken Language Processing for All: Makuhari, Chiba, Japan, 26-30 September 2010 |
Publisher | International Speech Communication Association |
Pages | 1616-1619 |
Number of pages | 4 |
ISBN (Print) | 9781617821233 |
Publication status | Published - 2010 |
Event | International Speech Communication Association. Conference - Duration: 9 Sept 2012 → … |
Conference
Conference | International Speech Communication Association. Conference |
---|---|
Period | 9/09/12 → … |
Keywords
- autoregressive models
- least squares
- linear prediction analysis
- speech perception
- ultrasonic speech