VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering.

Published in: CoRR (2023)

Keyphrases