Sign in

Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study.

Qirui JiaoDaoyuan ChenYilun HuangYaliang LiYing Shen
Published in: CoRR (2024)
Keyphrases