Sign in

LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models.

Hao ZhangHongyang LiFeng LiTianhe RenXueyan ZouShilong LiuShijia HuangJianfeng GaoLei ZhangChunyuan LiJianwei Yang
Published in: CoRR (2023)
Keyphrases
  • prior knowledge
  • database
  • email
  • real time
  • computer vision
  • model selection
  • experimental data
  • statistical models