​
Login / Signup
Xiaowei Hu
Publication Activity (10 Years)
Years Active: 2020-2022
Publications (10 Years): 25
Top Topics
Language Model
External Knowledge
Visual Representations
Low Level Image Processing
Top Venues
CoRR
CVPR
AAAI
NeurIPS
</>
Publications
</>
Chenfei Wu
,
Jian Liang
,
Xiaowei Hu
,
Zhe Gan
,
Jianfeng Wang
,
Lijuan Wang
,
Zicheng Liu
,
Yuejian Fang
,
Nan Duan
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis.
CoRR
(2022)
Sheng Shen
,
Chunyuan Li
,
Xiaowei Hu
,
Yujia Xie
,
Jianwei Yang
,
Pengchuan Zhang
,
Anna Rohrbach
,
Zhe Gan
,
Lijuan Wang
,
Lu Yuan
,
Ce Liu
,
Kurt Keutzer
,
Trevor Darrell
,
Jianfeng Gao
K-LITE: Learning Transferable Visual Models with External Knowledge.
CoRR
(2022)
Haotian Zhang
,
Pengchuan Zhang
,
Xiaowei Hu
,
Yen-Chun Chen
,
Liunian Harold Li
,
Xiyang Dai
,
Lijuan Wang
,
Lu Yuan
,
Jenq-Neng Hwang
,
Jianfeng Gao
GLIPv2: Unifying Localization and Vision-Language Understanding.
CoRR
(2022)
Zhiyuan Fang
,
Jianfeng Wang
,
Xiaowei Hu
,
Lin Liang
,
Zhe Gan
,
Lijuan Wang
,
Yezhou Yang
,
Zicheng Liu
Injecting Semantic Concepts into End-to-End Image Captioning.
CVPR
(2022)
Zhengyuan Yang
,
Zhe Gan
,
Jianfeng Wang
,
Xiaowei Hu
,
Faisal Ahmed
,
Zicheng Liu
,
Yumao Lu
,
Lijuan Wang
UniTAB: Unifying Text and Box Outputs for Grounded Vision-Language Modeling.
ECCV (36)
(2022)
Zhengyuan Yang
,
Zhe Gan
,
Jianfeng Wang
,
Xiaowei Hu
,
Yumao Lu
,
Zicheng Liu
,
Lijuan Wang
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA.
AAAI
(2022)
Jianfeng Wang
,
Zhengyuan Yang
,
Xiaowei Hu
,
Linjie Li
,
Kevin Lin
,
Zhe Gan
,
Zicheng Liu
,
Ce Liu
,
Lijuan Wang
GIT: A Generative Image-to-text Transformer for Vision and Language.
Trans. Mach. Learn. Res.
2022 (2022)
Haotian Zhang
,
Pengchuan Zhang
,
Xiaowei Hu
,
Yen-Chun Chen
,
Liunian Harold Li
,
Xiyang Dai
,
Lijuan Wang
,
Lu Yuan
,
Jenq-Neng Hwang
,
Jianfeng Gao
GLIPv2: Unifying Localization and Vision-Language Understanding.
NeurIPS
(2022)
Sheng Shen
,
Chunyuan Li
,
Xiaowei Hu
,
Yujia Xie
,
Jianwei Yang
,
Pengchuan Zhang
,
Zhe Gan
,
Lijuan Wang
,
Lu Yuan
,
Ce Liu
,
Kurt Keutzer
,
Trevor Darrell
,
Anna Rohrbach
,
Jianfeng Gao
K-LITE: Learning Transferable Visual Models with External Knowledge.
NeurIPS
(2022)
Jianfeng Wang
,
Zhengyuan Yang
,
Xiaowei Hu
,
Linjie Li
,
Kevin Lin
,
Zhe Gan
,
Zicheng Liu
,
Ce Liu
,
Lijuan Wang
GIT: A Generative Image-to-text Transformer for Vision and Language.
CoRR
(2022)
Xiaowei Hu
,
Zhe Gan
,
Jianfeng Wang
,
Zhengyuan Yang
,
Zicheng Liu
,
Yumao Lu
,
Lijuan Wang
Scaling Up Vision-Language Pretraining for Image Captioning.
CVPR
(2022)
Zhiyuan Fang
,
Jianfeng Wang
,
Xiaowei Hu
,
Lijuan Wang
,
Yezhou Yang
,
Zicheng Liu
Compressing Visual-linguistic Model via Knowledge Distillation.
CoRR
(2021)
Xiaowei Hu
,
Zhe Gan
,
Jianfeng Wang
,
Zhengyuan Yang
,
Zicheng Liu
,
Yumao Lu
,
Lijuan Wang
Scaling Up Vision-Language Pre-training for Image Captioning.
CoRR
(2021)
Jianfeng Wang
,
Xiaowei Hu
,
Zhe Gan
,
Zhengyuan Yang
,
Xiyang Dai
,
Zicheng Liu
,
Yumao Lu
,
Lijuan Wang
UFO: A UniFied TransfOrmer for Vision-Language Representation Learning.
CoRR
(2021)
Zhiyuan Fang
,
Jianfeng Wang
,
Xiaowei Hu
,
Lin Liang
,
Zhe Gan
,
Lijuan Wang
,
Yezhou Yang
,
Zicheng Liu
Injecting Semantic Concepts into End-to-End Image Captioning.
CoRR
(2021)
Zhiyuan Fang
,
Jianfeng Wang
,
Xiaowei Hu
,
Lijuan Wang
,
Yezhou Yang
,
Zicheng Liu
Compressing Visual-linguistic Model via Knowledge Distillation.
ICCV
(2021)
Pengchuan Zhang
,
Xiujun Li
,
Xiaowei Hu
,
Jianwei Yang
,
Lei Zhang
,
Lijuan Wang
,
Yejin Choi
,
Jianfeng Gao
VinVL: Making Visual Representations Matter in Vision-Language Models.
CoRR
(2021)
Xiaowei Hu
,
Xi Yin
,
Kevin Lin
,
Lei Zhang
,
Jianfeng Gao
,
Lijuan Wang
,
Zicheng Liu
VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning.
AAAI
(2021)
Zhengyuan Yang
,
Zhe Gan
,
Jianfeng Wang
,
Xiaowei Hu
,
Yumao Lu
,
Zicheng Liu
,
Lijuan Wang
An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA.
CoRR
(2021)
Pengchuan Zhang
,
Xiujun Li
,
Xiaowei Hu
,
Jianwei Yang
,
Lei Zhang
,
Lijuan Wang
,
Yejin Choi
,
Jianfeng Gao
VinVL: Revisiting Visual Representations in Vision-Language Models.
CVPR
(2021)
Zhengyuan Yang
,
Zhe Gan
,
Jianfeng Wang
,
Xiaowei Hu
,
Faisal Ahmed
,
Zicheng Liu
,
Yumao Lu
,
Lijuan Wang
Crossing the Format Boundary of Text and Boxes: Towards Unified Vision-Language Modeling.
CoRR
(2021)
Xiaowei Hu
,
Xi Yin
,
Kevin Lin
,
Lijuan Wang
,
Lei Zhang
,
Jianfeng Gao
,
Zicheng Liu
VIVO: Surpassing Human Performance in Novel Object Captioning with Visual Vocabulary Pre-Training.
CoRR
(2020)
Xiujun Li
,
Xi Yin
,
Chunyuan Li
,
Pengchuan Zhang
,
Xiaowei Hu
,
Lei Zhang
,
Lijuan Wang
,
Houdong Hu
,
Li Dong
,
Furu Wei
,
Yejin Choi
,
Jianfeng Gao
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks.
ECCV (30)
(2020)
Jianfeng Wang
,
Xiaowei Hu
,
Pengchuan Zhang
,
Xiujun Li
,
Lijuan Wang
,
Lei Zhang
,
Jianfeng Gao
,
Zicheng Liu
MiniVLM: A Smaller and Faster Vision-Language Model.
CoRR
(2020)
Xiujun Li
,
Xi Yin
,
Chunyuan Li
,
Pengchuan Zhang
,
Xiaowei Hu
,
Lei Zhang
,
Lijuan Wang
,
Houdong Hu
,
Li Dong
,
Furu Wei
,
Yejin Choi
,
Jianfeng Gao
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks.
CoRR
(2020)