Login / Signup

LCB-Net: Long-Context Biasing for Audio-Visual Speech Recognition.

Fan YuHaoxu WangXian ShiShiliang Zhang
Published in: ICASSP (2024)
Keyphrases
  • audio visual speech recognition
  • information retrieval systems
  • contextual information
  • multiscale
  • image quality
  • gaussian mixture model