Omnipotent Distillation with LLMs for Weakly-Supervised Natural Language Video Localization: When Divergence Meets Consistency.
Peijun BaoZihao ShaoWenhan YangBoon Poh NgMeng Hwa ErAlex C. KotPublished in: AAAI (2024)
Keyphrases
- weakly supervised
- object localization
- natural language
- weakly labeled
- object class
- semi supervised
- topic models
- relation extraction
- video sequences
- superpixels
- video data
- question answering
- automatic extraction
- object detectors
- video frames
- object detection
- knowledge representation
- training data
- object categories
- object classes
- natural language processing
- information extraction