VisionGPT-3D: A Generalized Multimodal Agent for Enhanced 3D Vision Understanding.

Published in: CoRR (2024)

Keyphrases