Sign in

Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models.

Muhammad MaazHanoona Abdul RasheedSalman H. KhanFahad Shahbaz Khan
Published in: CoRR (2023)
Keyphrases
  • language model
  • video content
  • video sequences
  • video data
  • video frames
  • probabilistic model
  • information retrieval
  • language modeling
  • speech recognition
  • vector space model
  • video retrieval
  • smoothing methods