Login / Signup
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization.
Yang Jin
Kun Xu
Kun Xu
Liwei Chen
Chao Liao
Jianchao Tan
Quzhe Huang
Bin Chen
Chengru Song
Dai Meng
Di Zhang
Wenwu Ou
Kun Gai
Yadong Mu
Published in:
ICLR (2024)
Keyphrases
</>
visual perception
visual processing
dynamic environments
visual field
programming language
character n grams
video sequences
human vision
visual information
vision system
language learning
natural language
computer vision
information retrieval
probabilistic model
object recognition
search engine