Sign in

MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition.

Yuchen HuChen ChenRuizhe LiHeqing ZouEng Siong Chng
Published in: ACL (1) (2023)
Keyphrases
  • invariant representations
  • audio visual speech recognition
  • image processing
  • multi modal
  • invariant representation