Login / Signup
VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset.
Sihan Chen
Xingjian He
Longteng Guo
Xinxin Zhu
Weining Wang
Jinhui Tang
Jing Liu
Published in:
CoRR (2023)
Keyphrases
</>
computer vision
probabilistic model
computational model
statistical model
high level
formal model
probability distribution
management system
real time
mathematical model
conceptual model
definition language
specification language
language learning
em algorithm
cost function
prior knowledge
similarity measure