Long-Form Video-Language Pre-Training with Multimodal Temporal Contrastive Learning.
Yuchong SunHongwei XueRuihua SongBei LiuHuan YangJianlong FuPublished in: CoRR (2022)
Keyphrases
- learning process
- learning systems
- learning algorithm
- active learning
- online learning
- spatial and temporal
- video sequences
- supervised learning
- temporal information
- interactive video
- language acquisition
- feedforward neural networks
- inductive inference
- multimedia
- temporal data
- training process
- learning tasks
- temporal analysis