Login / Signup

Discrete Multimodal Transformers with a Pretrained Large Language Model for Mixed-Supervision Speech Processing.

Viet Anh TrinhRosy SouthwellYiwen GuanXinlu HeZhiyong WangJacob Whitehill
Published in: CoRR (2024)
Keyphrases