Login / Signup

AVicuna: Audio-Visual LLM with Interleaver and Context-Boundary Alignment for Temporal Referential Dialogue.

Yunlong TangDaiki ShimadaJing BiChenliang Xu
Published in: CoRR (2024)
Keyphrases