Login / Signup
MM-VID: Advancing Video Understanding with GPT-4V(ision).
Kevin Lin
Faisal Ahmed
Linjie Li
Chung-Ching Lin
Ehsan Azarnasab
Zhengyuan Yang
Jianfeng Wang
Lin Liang
Zicheng Liu
Yumao Lu
Ce Liu
Lijuan Wang
Published in:
CoRR (2023)
Keyphrases
</>
video data
video streams
real time
video content
real time video
multimedia
video frames
data sets
video retrieval
video sequences
video analysis
video processing
deeper understanding
video segments
space time
multi view
event detection
key frames
medical images
spatio temporal
quality metrics
visual analysis