mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections.
Chenliang LiHaiyang XuJunfeng TianWei WangMing YanBin BiJiabo YeHe ChenGuohai XuZheng CaoJi ZhangSongfang HuangFei HuangJingren ZhouLuo SiPublished in: EMNLP (2022)