Login / Signup

Detect2Interact: Localizing Object Key Field in Visual Question Answering (VQA) with LLMs.

Jialou WangManli ZhuYulei LiHonglei LiLongzhi YangWai Lok Woo
Published in: CoRR (2024)
Keyphrases