Login / Signup
Hearing Touch: Audio-Visual Pretraining for Contact-Rich Manipulation.
Jared Mejia
Victoria Dean
Tess Lee Hellebrekers
Abhinav Gupta
Published in:
CoRR (2024)
Keyphrases
</>
audio visual
multi modal
visual information
video summarization
multimedia
visual data
multi stream
temporal context
emotion recognition
audio visual speech recognition
person authentication
computer vision
nearest neighbor
pattern recognition
video sequences
high level
data sets