Login / Signup
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition.
Yuchen Hu
Chen Chen
Ruizhe Li
Heqing Zou
Eng Siong Chng
Published in:
ACL (1) (2023)
Keyphrases
</>
invariant representations
audio visual speech recognition
image processing
multi modal
invariant representation