Language-Guided Self-Supervised Video Summarization Using Text Semantic Matching Considering the Diversity of the Video.
Tomoya SugiharaShuntaro MasudaLing XiaoToshihiko YamasakiPublished in: CoRR (2024)
Keyphrases
- video summarization
- semantic matching
- video content
- video browsing
- video data
- audio visual
- key frames
- video summaries
- surveillance videos
- video sequences
- ontology alignment
- event detection
- video retrieval
- video frames
- video segments
- multi modal
- natural language
- text mining
- information retrieval
- database
- web services
- low level features
- web service composition
- computer vision
- video clips
- video database
- high level
- video analysis
- service composition
- keywords
- video streams
- semantic information
- ontology matching
- visual features
- real time