MSVD-Indonesian: A Benchmark for Multimodal Video-Text Tasks in Indonesian.
Willy Fitra HendriaPublished in: CoRR (2023)
Keyphrases
- multimedia
- multiple modalities
- geographic information retrieval
- video data
- information retrieval
- video sequences
- natural language descriptions
- video search
- news video
- text mining
- machine translation
- video frames
- video streams
- text detection
- video retrieval
- video content
- web documents
- multi modal
- database
- real time
- key frames
- story segmentation
- free text
- natural language processing systems
- multimedia documents
- real world
- video collections
- closed captions
- video images
- video database
- visual data
- text retrieval
- moving objects
- keywords
- image sequences