Login / Signup
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models.
Muhammad Maaz
Hanoona Abdul Rasheed
Salman Khan
Fahad Khan
Published in:
ACL (1) (2024)
Keyphrases
</>
language model
video sequences
video data
speech recognition
n gram
multimedia
video frames
language modeling
query expansion
key frames
language modelling
information retrieval
video content
video retrieval
video shots
statistical language models
document retrieval