Login / Signup

V2Xum-LLM: Cross-Modal Video Summarization with Temporal Prompt Instruction Tuning.

Hang HuaYunlong TangChenliang XuJiebo Luo
Published in: CoRR (2024)
Keyphrases