Login / Signup

Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!

Zaid KhanVijay Kumar B. GSamuel SchulterXiang YuYun FuManmohan Chandraker
Published in: CoRR (2023)
Keyphrases
  • language model
  • data points
  • data sources
  • computer vision
  • decision trees
  • probabilistic model
  • document retrieval
  • machine learning
  • information retrieval
  • training data
  • keywords