High-resolution dynamic speech imaging with joint low-rank and sparsity constraints

Maojing Fu, Bo Zhao, Christopher Carignan, Ryan K. Shosted, Jamie L. Perry, David P. Kuehn, Zhi-Pei Liang, Bradley P. Sutton

    Research output: Contribution to journalArticlepeer-review

    66 Citations (Scopus)

    Abstract

    Purpose: To enable dynamic speech imaging with high spatiotemporal resolution and full-vocal-tract spatial coverage, leveraging recent advances in sparse sampling. Methods: An imaging method is developed to enable high-speed dynamic speech imaging exploiting low-rank and sparsity of the dynamic images of articulatory motion during speech. The proposed method includes: (a) a novel data acquisition strategy that collects spiral navigators with high temporal frame rate and (b) an image reconstruction method that derives temporal subspaces from navigators and reconstructs high-resolution images from sparsely sampled data with joint low-rank and sparsity constraints. Results: The proposed method has been systematically evaluated and validated through several dynamic speech experiments. A nominal imaging speed of 102 frames per second (fps) was achieved for a single-slice imaging protocol with a spatial resolution of 2.2 × 2.2 × 6.5 mm3. An eight-slice imaging protocol covering the entire vocal tract achieved a nominal imaging speed of 12.8 fps with the identical spatial resolution. The effectiveness of the proposed method and its practical utility was also demonstrated in a phonetic investigation. Conclusion High spatiotemporal resolution with full-vocal-tract spatial coverage can be achieved for dynamic speech imaging experiments with low-rank and sparsity constraints.
    Original languageEnglish
    Pages (from-to)1820-1832
    Number of pages13
    JournalMagnetic Resonance in Medicine
    Volume73
    Issue number5
    DOIs
    Publication statusPublished - 2015

    Keywords

    • magnetic resonance imaging
    • speech

    Fingerprint

    Dive into the research topics of 'High-resolution dynamic speech imaging with joint low-rank and sparsity constraints'. Together they form a unique fingerprint.

    Cite this