Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training.
Chenyi LeiShixian LuoYong LiuWanggui HeJiamang WangGuoxin WangHaihong TangChunyan MiaoHouqiang LiPublished in: ACM Multimedia (2021)
Keyphrases
- story segmentation
- multimedia
- video data
- video sequences
- chinese language
- broadcast news
- video streams
- video content
- training set
- programming language
- multi modal
- language learning
- training process
- video retrieval
- training phase
- video database
- news video
- neural network
- chinese text
- video analysis
- video clips
- training samples
- supervised learning
- chinese characters
- conceptual models
- spatial and temporal
- human computer interaction
- learning algorithm