Login / Signup
Detecting Mismatch Between Speech and Transcription Using Cross-Modal Attention.
Qiang Huang
Thomas Hain
Published in:
INTERSPEECH (2019)
Keyphrases
</>
cross modal
multi modal
automatic transcription
multimedia retrieval
speech recognition
multimedia databases
image retrieval
visual recognition
visual data
visual similarity
perceptual information
information retrieval
multimedia