Towards Holistic Language-video Representation: the language model-enhanced MSR-Video to Text Dataset.
Yuchen YangYingxuan DuanPublished in: CoRR (2024)
Keyphrases
- language model
- video representation
- spatio temporal
- space time
- information retrieval
- video analysis
- video streams
- video database
- video content
- generative model
- probabilistic model
- n gram
- dynamic textures
- human actions
- key frames
- document retrieval
- video processing
- query expansion
- retrieval model
- test collection
- video data
- text retrieval
- action recognition
- video objects
- mixture model
- text mining
- translation model
- video frames
- video sequences
- spatial and temporal
- video retrieval
- text documents
- semantic information
- multimedia
- natural language
- machine learning
- event detection
- semi supervised