Login / Signup

MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition.

Yuchen HuChen ChenRuizhe LiHeqing ZouEng Siong Chng
Published in: CoRR (2023)
Keyphrases
  • invariant representations
  • audio visual speech recognition
  • multi modal
  • keywords
  • three dimensional
  • object recognition
  • mathematical morphology
  • multi stream