Login / Signup
Cross-Modal Language Modeling in Multi-Motion-Informed Context for Lip Reading.
Xi Ai
Bin Fang
Published in:
IEEE ACM Trans. Audio Speech Lang. Process. (2023)
Keyphrases
</>
language modeling
cross modal
language model
retrieval model
information retrieval
multi modal
probabilistic model
query expansion
visual data
image sequences
n gram
contextual information
space time
document retrieval
information retrieval systems
text classification