Login / Signup
Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning.
Ankit P. Shah
Shijie Geng
Peng Gao
Anoop Cherian
Takaaki Hori
Tim K. Marks
Jonathan Le Roux
Chiori Hori
Published in:
CoRR (2021)
Keyphrases
</>
audio visual
visual data
multi modal
visual information
multimedia
learning process
multi stream
machine learning
e learning
training set
online learning
learning strategies