Tal: A Synchronised Multi-Speaker Corpus of Ultrasound Tongue Imaging, Audio, and Lip Videos.
Manuel Sam RibeiroJennifer SangerJing-Xuan ZhangAciel EshkyAlan WrenchKorin RichmondSteve RenalsPublished in: SLT (2021)
Keyphrases
- ultrasound images
- ultrasound imaging
- audio visual
- visual speech
- audio visual speech recognition
- prosodic features
- visual data
- speaker identification
- audio features
- multimedia
- spontaneous speech
- vocal tract
- image analysis
- computer aided
- video sequences
- automatic transcription
- high resolution
- video indexing and retrieval
- visual information
- radio frequency
- video content
- video data
- video analysis
- imaging systems
- computer vision
- video material
- speaker verification
- audio stream
- key frames
- medical imaging
- video signals
- video clips
- motion analysis
- human activities
- audio signals
- hidden markov models
- lecture videos
- speech recognition
- video frames