Login / Signup

VicTR: Video-conditioned Text Representations for Activity Recognition.

Kumara KahatapitiyaAnurag ArnabArsha NagraniMichael S. Ryoo
Published in: CoRR (2023)
Keyphrases