Login / Signup
Finding Fallen Objects Via Asynchronous Audio-Visual Integration.
Chuang Gan
Yi Gu
Siyuan Zhou
Jeremy Schwartz
Seth Alter
James Traer
Dan Gutfreund
Joshua B. Tenenbaum
Josh H. McDermott
Antonio Torralba
Published in:
CoRR (2022)
Keyphrases
</>
audio visual
visual data
multi modal
multi stream
emotion recognition
multimedia
temporal context
d objects
visual information
audio visual speech recognition
person authentication
visual features
human computer interaction
data sets
nearest neighbor
moving objects
multiscale
image sequences
high level
databases