Video Referring Expression Comprehension via Transformer with Content-conditioned Query.
Ji JiangMeng CaoTengtao SongLong ChenYi WangYuexian ZouPublished in: CoRR (2023)
Keyphrases
- video segments
- content based search
- multimedia documents
- multimedia
- online video
- query processing
- multimedia data
- database
- textual descriptions
- video sequences
- metadata
- video frames
- data structure
- video content
- response time
- video clips
- user queries
- retrieval systems
- video retrieval
- query evaluation
- relevance feedback
- semantic content
- video search
- event detection
- user interaction
- lecture videos
- video footage
- web queries
- content based video retrieval
- semantic meaning
- concept detectors
- video material
- user generated
- visual concepts
- video analysis
- visual information
- video data
- fuzzy logic
- neural network
- keyword search
- multimedia content
- query terms
- tv programs
- query expansion
- real time