Sign in

LCB-net: Long-Context Biasing for Audio-Visual Speech Recognition.

Fan YuHaoxu WangXian ShiShiliang Zhang
Published in: CoRR (2024)
Keyphrases
  • audio visual speech recognition
  • multi stream