Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery.
Guankun WangLong BaiWan Jun NahJie WangZhaoxi ZhangZhen ChenJinlin WuMobarakol IslamHongbin LiuHongliang RenPublished in: CoRR (2024)