Login / Signup
An Empirical Study of Training End-to-End Vision-and-Language Transformers.
Zi-Yi Dou
Yichong Xu
Zhe Gan
Jianfeng Wang
Shuohang Wang
Lijuan Wang
Chenguang Zhu
Pengchuan Zhang
Lu Yuan
Nanyun Peng
Zicheng Liu
Michael Zeng
Published in:
CoRR (2021)
Keyphrases
</>
end to end
congestion control
admission control
ad hoc networks
high bandwidth
computer vision
multipath
real time
scalable video
sensor networks
application layer
text localization and recognition
rate adaptation
internet protocol
rate allocation
content delivery
image sequences
neural network