Sign in

ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model.

Guiming Hardy ChenShunian ChenRuifei ZhangJunying ChenXiangbo WuZhiyi ZhangZhihong ChenJianquan LiXiang WanBenyou Wang
Published in: CoRR (2024)
Keyphrases
  • language model
  • information retrieval
  • speech recognition
  • n gram
  • document retrieval
  • search engine
  • text categorization