Login / Signup
Audio-Visual Scene-Aware Dialog and Reasoning Using Audio-Visual Transformers with Joint Student-Teacher Learning.
Ankit P. Shah
Shijie Geng
Peng Gao
Anoop Cherian
Takaaki Hori
Tim K. Marks
Jonathan Le Roux
Chiori Hori
Published in:
ICASSP (2022)
Keyphrases
</>
audio visual
visual data
multi modal
visual information
multi stream
multimedia
learning process
online learning
active learning
machine learning
high level
image sequences
student teachers