Login / Signup
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models.
Muhammad Maaz
Hanoona Abdul Rasheed
Salman H. Khan
Fahad Shahbaz Khan
Published in:
CoRR (2023)
Keyphrases
</>
language model
video content
video sequences
video data
video frames
probabilistic model
information retrieval
language modeling
speech recognition
vector space model
video retrieval
smoothing methods