Login / Signup
Single-Stream Multi-Level Alignment for Vision-Language Pretraining.
Zaid Khan
Vijay Kumar B. G
Xiang Yu
Samuel Schulter
Manmohan Chandraker
Yun Fu
Published in:
CoRR (2022)
Keyphrases
</>
real time
image processing
natural language
programming language
language learning
computer vision
data streams
vision system
sliding window
database
data mining
information retrieval
hidden markov models
object oriented
text mining
dynamic time warping