Publication: What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models.