Login / Signup

TMT: Tri-Modal Translation between Speech, Image, and Text by Processing Different Modalities as Different Languages.

Minsu KimJee-weon JungHyeongseop RhaSoumi MaitiSiddhant AroraXuankai ChangShinji WatanabeYong Man Ro
Published in: CoRR (2024)
Keyphrases