Vocal Tract Articulatory Contour Detection in Real-Time Magnetic Resonance Images Using Spatio-Temporal Context

No Thumbnail Available

Date

2020

Authors

Hebbar S.A.
Sharma R.
Somandepalli K.
Toutios A.
Narayanan S.

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Due to its ability to visualize and measure the dynamics of vocal tract shaping during speech production, real-time magnetic resonance imaging (rtMRI) has emerged as one of the prominent research tools. The ability to track different articulators such as the tongue, lips, velum, and the pharynx is a crucial step toward automating further scientific and clinical analysis. Recently, various researchers have addressed the problem of detecting articulatory boundaries, but those are primarily limited to static-image based methods. In this work, we propose to use information from temporal dynamics together with the spatial structure to detect the articulatory boundaries in rtMRI videos. We train a convolutional LSTM network to detect and label the articulatory contours. We compare the produced contours against reference labels generated by iteratively fitting a manually created subject-specific template. We observe that the proposed method outperforms solely image-based methods, especially for the difficult-to-track articulators involved in airway constriction formation during speech. © 2020 IEEE.

Description

Keywords

Citation

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , Vol. 2020-May , , p. 7354 - 7358

Endorsement

Review

Supplemented By

Referenced By