Seeing Speech: Magnetic Resonance Imaging-Based Vocal Tract Deformation Visualization Using Cross-Modal Transformer.
Kele XuMing FengWeiquan HuangPublished in: ACM Multimedia (2022)
Keyphrases
- magnetic resonance imaging
- vocal tract
- cross modal
- speech signal
- speech synthesis
- multi modal
- mri data
- medical images
- speech recognition
- image acquisition
- medical imaging
- speech sounds
- multimedia retrieval
- image registration
- image retrieval
- text to speech
- data analysis
- automatic speech recognition
- linear prediction
- multimedia databases
- hidden markov models
- neural network
- visual similarity
- object recognition
- audio visual
- document retrieval
- non stationary
- language model
- pattern recognition
- multiscale