Login / Signup
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation.
Qiushi Zhu
Jie Zhang
Yu Gu
Yuchen Hu
Lirong Dai
Published in:
AAAI (2024)
Keyphrases
</>
multi modal
audio visual
uni modal
cross modal
machine learning
active learning
visual recognition
video search
high dimensional
image registration
visual information
semantic concepts
multi modality
multiple modalities