Login / Signup

TokenPacker: Efficient Visual Projector for Multimodal LLM.

Wentong LiYuqian YuanJian LiuDongqi TangSong WangJianke ZhuLei Zhang
Published in: CoRR (2024)
Keyphrases
  • cost effective
  • multi modal
  • computationally expensive
  • visual cues
  • neural network
  • multimedia
  • hidden markov models
  • multimodal information