InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding.
Yi WangKunchang LiXinhao LiJiashuo YuYinan HeGuo ChenBaoqi PeiRongkun ZhengJilan XuZun WangYansong ShiTianxiang JiangSongze LiHongjie ZhangYifei HuangYu QiaoYali WangLimin WangPublished in: CoRR (2024)
Keyphrases
- video sequences
- video data
- video streams
- multimedia
- video content
- video database
- video analysis
- video images
- video frames
- real time
- real time video
- special effects
- computational modeling
- key frames
- parameter estimation
- video surveillance
- probabilistic model
- prior knowledge
- visual cues
- video processing
- artificial neural networks
- event recognition
- learning algorithm
- information retrieval
- neural network