Login / Signup
Weakly Supervised Gaussian Contrastive Grounding with Large Multimodal Models for Video Question Answering.
Haibo Wang
Chenghang Lai
Yixuan Sun
Weifeng Ge
Published in:
CoRR (2024)
Keyphrases
</>
expert systems
question answering
weakly supervised
relation extraction
named entities
information extraction
natural language processing
automatic extraction
probabilistic model
information retrieval
video sequences
natural language
semi supervised
object detection
object detectors
multi modal
computer vision