Login / Signup

What do MLLMs hear? Examining reasoning with text and sound components in Multimodal Large Language Models.

Enis Berk ÇobanMichael I. MandelJohanna Devaney
Published in: CoRR (2024)
Keyphrases