Overcoming Weak Visual-Textual Alignment for Video Moment Retrieval.
Minjoon JungYouwon JangSeongho ChoiJoochan KimJin-Hwa KimByoung-Tak ZhangPublished in: CoRR (2023)
Keyphrases
- content based video retrieval
- video search
- news video
- multimedia
- content based indexing
- video data
- video indexing
- video indexing and retrieval
- visual features
- visual content
- video content
- visual data
- video retrieval
- lifelog
- multimedia information
- multimedia data
- textual descriptions
- content based retrieval
- semantic video
- visual concepts
- information retrieval
- visual representations
- visual information
- video sequences
- video streams
- semantic content
- visual cues
- real time
- visual analysis
- visual and textual information
- cut detection
- cross modal
- video database
- multimedia documents
- key frames
- concept detectors
- content based video
- medical image retrieval
- low level
- multimedia databases
- textual query
- video collections
- keywords
- video clips
- video frames
- multimedia search
- video dataset
- audio visual content
- visual and textual features
- video shots
- video segmentation
- video analysis
- video surveillance
- retrieval systems
- information retrieval systems
- image retrieval
- user generated
- web images
- textual information
- high level
- image database
- multimedia content
- search engine
- metadata
- natural language
- semantic concept detection
- visual similarity
- content description
- visual representation