Login / Signup
Self-supervised video pretraining yields robust and more human-aligned visual representations.
Nikhil Parthasarathy
S. M. Ali Eslami
João Carreira
Olivier J. Hénaff
Published in:
NeurIPS (2023)
Keyphrases
</>
domain specific
visual representations
video sequences
video data
multimedia
video content
video frames
visualization tools
video retrieval
video analysis
visual representation
data mining
learning process
d objects
object detection
visual analysis