Login / Signup
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition.
Yuchen Hu
Chen Chen
Ruizhe Li
Heqing Zou
Eng Siong Chng
Published in:
CoRR (2023)
Keyphrases
</>
invariant representations
audio visual speech recognition
multi modal
keywords
three dimensional
object recognition
mathematical morphology
multi stream