Login / Signup

Can VLMs be used on videos for action recognition? LLMs are Visual Reasoning Coordinators.

Harsh Lunia
Published in: CoRR (2024)
Keyphrases