​
Login / Signup
Wanrong Zhu
ORCID
Publication Activity (10 Years)
Years Active: 2018-2024
Publications (10 Years): 43
Top Topics
Street View
Stochastic Gradient Descent
Natural Language
Language Model
Top Venues
CoRR
NeurIPS
EACL (Findings)
NAACL-HLT
</>
Publications
</>
An Yan
,
Zhengyuan Yang
,
Junda Wu
,
Wanrong Zhu
,
Jianwei Yang
,
Linjie Li
,
Kevin Lin
,
Jianfeng Wang
,
Julian J. McAuley
,
Jianfeng Gao
,
Lijuan Wang
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs.
CoRR
(2024)
Raphael Schumann
,
Wanrong Zhu
,
Weixi Feng
,
Tsu-Jui Fu
,
Stefan Riezler
,
William Yang Wang
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View.
AAAI
(2024)
Xuehai He
,
Weixi Feng
,
Kaizhi Zheng
,
Yujie Lu
,
Wanrong Zhu
,
Jiachen Li
,
Yue Fan
,
Jianfeng Wang
,
Linjie Li
,
Zhengyuan Yang
,
Kevin Lin
,
William Yang Wang
,
Lijuan Wang
,
Xin Eric Wang
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos.
CoRR
(2024)
Wanrong Zhu
,
Zhipeng Lou
,
Ziyang Wei
,
Wei Biao Wu
High Confidence Level Inference is Almost Free using Parallel Stochastic Optimization.
CoRR
(2024)
Zekun Li
,
Xianjun Yang
,
Kyuri Choi
,
Wanrong Zhu
,
Ryan Hsieh
,
HyeonJung Kim
,
Jin Hyuk Lim
,
Sungyoung Ji
,
Byungju Lee
,
Xifeng Yan
,
Linda Ruth Petzold
,
Stephen D. Wilson
,
Woosang Lim
,
William Yang Wang
MMSci: A Multimodal Multi-Discipline Dataset for PhD-Level Scientific Comprehension.
CoRR
(2024)
Wanrong Zhu
,
Jennifer Healey
,
Ruiyi Zhang
,
William Yang Wang
,
Tong Sun
Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models.
CoRR
(2024)
Wanrong Zhu
,
Xin Wang
,
An Yan
,
Miguel P. Eckstein
,
William Yang Wang
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation.
EACL (Findings)
(2023)
Yujie Lu
,
Pan Lu
,
Zhiyu Chen
,
Wanrong Zhu
,
Xin Eric Wang
,
William Yang Wang
Multimodal Procedural Planning via Dual Text-Image Prompting.
CoRR
(2023)
Xinyi Wang
,
Wanrong Zhu
,
Michael Saxon
,
Mark Steyvers
,
William Yang Wang
Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context Learning.
NeurIPS
(2023)
Xinyi Wang
,
Wanrong Zhu
,
William Yang Wang
Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Learning.
CoRR
(2023)
Wanrong Zhu
,
An Yan
,
Yujie Lu
,
Wenda Xu
,
Xin Wang
,
Miguel P. Eckstein
,
William Yang Wang
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation.
EACL (Findings)
(2023)
Wanrong Zhu
,
Jack Hessel
,
Anas Awadalla
,
Samir Yitzhak Gadre
,
Jesse Dodge
,
Alex Fang
,
Youngjae Yu
,
Ludwig Schmidt
,
William Yang Wang
,
Yejin Choi
Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved with Text.
NeurIPS
(2023)
Wanrong Zhu
,
Xinyi Wang
,
Yujie Lu
,
Tsu-Jui Fu
,
Xin Eric Wang
,
Miguel P. Eckstein
,
William Yang Wang
Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation.
CoRR
(2023)
Wanrong Zhu
,
Jack Hessel
,
Anas Awadalla
,
Samir Yitzhak Gadre
,
Jesse Dodge
,
Alex Fang
,
Youngjae Yu
,
Ludwig Schmidt
,
William Yang Wang
,
Yejin Choi
Multimodal C4: An Open, Billion-scale Corpus of Images Interleaved With Text.
CoRR
(2023)
Anas Awadalla
,
Irena Gao
,
Josh Gardner
,
Jack Hessel
,
Yusuf Hanafy
,
Wanrong Zhu
,
Kalyani Marathe
,
Yonatan Bitton
,
Samir Yitzhak Gadre
,
Shiori Sagawa
,
Jenia Jitsev
,
Simon Kornblith
,
Pang Wei Koh
,
Gabriel Ilharco
,
Mitchell Wortsman
,
Ludwig Schmidt
OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models.
CoRR
(2023)
Yonatan Bitton
,
Hritik Bansal
,
Jack Hessel
,
Rulin Shao
,
Wanrong Zhu
,
Anas Awadalla
,
Josh Gardner
,
Rohan Taori
,
Ludwig Schmidt
VisIT-Bench: A Dynamic Benchmark for Evaluating Instruction-Following Vision-and-Language Models.
NeurIPS
(2023)
Weixi Feng
,
Wanrong Zhu
,
Tsu-Jui Fu
,
Varun Jampani
,
Arjun R. Akula
,
Xuehai He
,
Sugato Basu
,
Xin Eric Wang
,
William Yang Wang
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models.
NeurIPS
(2023)
An Yan
,
Zhengyuan Yang
,
Wanrong Zhu
,
Kevin Lin
,
Linjie Li
,
Jianfeng Wang
,
Jianwei Yang
,
Yiwu Zhong
,
Julian J. McAuley
,
Jianfeng Gao
,
Zicheng Liu
,
Lijuan Wang
GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation.
CoRR
(2023)
Weixi Feng
,
Wanrong Zhu
,
Tsu-Jui Fu
,
Varun Jampani
,
Arjun R. Akula
,
Xuehai He
,
Sugato Basu
,
Xin Eric Wang
,
William Yang Wang
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models.
CoRR
(2023)
Yujie Lu
,
Weixi Feng
,
Wanrong Zhu
,
Wenda Xu
,
Xin Eric Wang
,
Miguel P. Eckstein
,
William Yang Wang
Neuro-Symbolic Procedural Planning with Commonsense Prompting.
ICLR
(2023)
Ziyang Wei
,
Wanrong Zhu
,
Wei Biao Wu
Weighted Averaged Stochastic Gradient Descent: Asymptotic Normality and Optimality.
CoRR
(2023)
Raphael Schumann
,
Wanrong Zhu
,
Weixi Feng
,
Tsu-Jui Fu
,
Stefan Riezler
,
William Yang Wang
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View.
CoRR
(2023)
Wanrong Zhu
,
Xinyi Wang
,
Yujie Lu
,
Tsu-Jui Fu
,
Xin Wang
,
Miguel P. Eckstein
,
William Wang
Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation.
EMNLP
(2023)
Yonatan Bitton
,
Hritik Bansal
,
Jack Hessel
,
Rulin Shao
,
Wanrong Zhu
,
Anas Awadalla
,
Josh Gardner
,
Rohan Taori
,
Ludwig Schmidt
VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use.
CoRR
(2023)
Wanrong Zhu
,
Zhipeng Lou
,
Wei Biao Wu
Beyond Sub-Gaussian Noises: Sharp Concentration Analysis for Stochastic Gradient Descent.
J. Mach. Learn. Res.
23 (2022)
Yujie Lu
,
Weixi Feng
,
Wanrong Zhu
,
Wenda Xu
,
Xin Eric Wang
,
Miguel P. Eckstein
,
William Yang Wang
Neuro-Symbolic Causal Language Planning with Commonsense Prompting.
CoRR
(2022)
Wanrong Zhu
,
An Yan
,
Yujie Lu
,
Wenda Xu
,
Xin Eric Wang
,
Miguel Eckstein
,
William Yang Wang
Visualize Before You Write: Imagination-Guided Open-Ended Text Generation.
CoRR
(2022)
Yujie Lu
,
Wanrong Zhu
,
Xin Wang
,
Miguel Eckstein
,
William Yang Wang
Imagination-Augmented Natural Language Understanding.
NAACL-HLT
(2022)
Wanrong Zhu
,
Yuankai Qi
,
Pradyumna Narayana
,
Kazoo Sone
,
Sugato Basu
,
Xin Wang
,
Qi Wu
,
Miguel P. Eckstein
,
William Yang Wang
Diagnosing Vision-and-Language Navigation: What Really Matters.
NAACL-HLT
(2022)
An Yan
,
Jiacheng Li
,
Wanrong Zhu
,
Yujie Lu
,
William Yang Wang
,
Julian J. McAuley
CLIP also Understands Text: Prompting CLIP for Phrase Understanding.
CoRR
(2022)
Wanrong Zhu
,
Bo Pang
,
Ashish V. Thapliyal
,
William Yang Wang
,
Radu Soricut
End-to-end Dense Video Captioning as Sequence Generation.
COLING
(2022)
Wanrong Zhu
,
Bo Pang
,
Ashish V. Thapliyal
,
William Yang Wang
,
Radu Soricut
End-to-end Dense Video Captioning as Sequence Generation.
CoRR
(2022)
Yujie Lu
,
Wanrong Zhu
,
Xin Eric Wang
,
Miguel P. Eckstein
,
William Yang Wang
Imagination-Augmented Natural Language Understanding.
CoRR
(2022)
Wanrong Zhu
,
Xin Eric Wang
,
An Yan
,
Miguel P. Eckstein
,
William Yang Wang
ImaginE: An Imagination-Based Automatic Evaluation Metric for Natural Language Generation.
CoRR
(2021)
Wanrong Zhu
,
Xin Wang
,
Tsu-Jui Fu
,
An Yan
,
Pradyumna Narayana
,
Kazoo Sone
,
Sugato Basu
,
William Yang Wang
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation.
EACL
(2021)
Wanrong Zhu
,
Yuankai Qi
,
Pradyumna Narayana
,
Kazoo Sone
,
Sugato Basu
,
Xin Eric Wang
,
Qi Wu
,
Miguel P. Eckstein
,
William Yang Wang
Diagnosing Vision-and-Language Navigation: What Really Matters.
CoRR
(2021)
Wanrong Zhu
,
Xin Wang
,
Tsu-Jui Fu
,
An Yan
,
Pradyumna Narayana
,
Kazoo Sone
,
Sugato Basu
,
William Yang Wang
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation.
CoRR
(2020)
Wanrong Zhu
,
Xin Eric Wang
,
Pradyumna Narayana
,
Kazoo Sone
,
Sugato Basu
,
William Yang Wang
Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations.
CoRR
(2020)
Wanrong Zhu
,
Xi Chen
,
Wei Biao Wu
A Fully Online Approach for Covariance Matrices Estimation of Stochastic Gradient Descent Solutions.
CoRR
(2020)
Wanrong Zhu
,
Xin Wang
,
Pradyumna Narayana
,
Kazoo Sone
,
Sugato Basu
,
William Yang Wang
Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations.
EMNLP (1)
(2020)
Zhiting Hu
,
Haoran Shi
,
Bowen Tan
,
Wentao Wang
,
Zichao Yang
,
Tiancheng Zhao
,
Junxian He
,
Lianhui Qin
,
Di Wang
,
Xuezhe Ma
,
Zhengzhong Liu
,
Xiaodan Liang
,
Wanrong Zhu
,
Devendra Singh Sachan
,
Eric P. Xing
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation.
ACL (3)
(2019)
Wanrong Zhu
,
Zhiting Hu
,
Eric P. Xing
Text Infilling.
CoRR
(2019)
Zhiting Hu
,
Haoran Shi
,
Zichao Yang
,
Bowen Tan
,
Tiancheng Zhao
,
Junxian He
,
Wentao Wang
,
Xingjiang Yu
,
Lianhui Qin
,
Di Wang
,
Xuezhe Ma
,
Zhengzhong Liu
,
Xiaodan Liang
,
Wanrong Zhu
,
Devendra Singh Sachan
,
Eric P. Xing
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation.
CoRR
(2018)