Login / Signup
Weixi Feng
Publication Activity (10 Years)
Years Active: 2022-2024
Publications (10 Years): 20
Top Topics
Image Search
Temporal Consistency
Language Learners
Diffusion Models
Top Venues
CoRR
EMNLP
ICLR
AAAI
</>
Publications
</>
Jiachen Li
,
Weixi Feng
,
Wenhu Chen
,
William Yang Wang
Reward Guided Latent Consistency Distillation.
CoRR
(2024)
Raphael Schumann
,
Wanrong Zhu
,
Weixi Feng
,
Tsu-Jui Fu
,
Stefan Riezler
,
William Yang Wang
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View.
AAAI
(2024)
Xuehai He
,
Weixi Feng
,
Kaizhi Zheng
,
Yujie Lu
,
Wanrong Zhu
,
Jiachen Li
,
Yue Fan
,
Jianfeng Wang
,
Linjie Li
,
Zhengyuan Yang
,
Kevin Lin
,
William Yang Wang
,
Lijuan Wang
,
Xin Eric Wang
MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos.
CoRR
(2024)
Jiachen Li
,
Weixi Feng
,
Tsu-Jui Fu
,
Xinyi Wang
,
Sugato Basu
,
Wenhu Chen
,
William Yang Wang
T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback.
CoRR
(2024)
Weixi Feng
,
Jiachen Li
,
Michael Saxon
,
Tsu-Jui Fu
,
Wenhu Chen
,
William Yang Wang
TC-Bench: Benchmarking Temporal Compositionality in Text-to-Video and Image-to-Video Generation.
CoRR
(2024)
Xuehai He
,
Weixi Feng
,
Tsu-Jui Fu
,
Varun Jampani
,
Arjun R. Akula
,
Pradyumna Narayana
,
Sugato Basu
,
William Yang Wang
,
Xin Eric Wang
Discriminative Diffusion Models as Few-shot Vision and Language Learners.
CoRR
(2023)
Siqi Liu
,
Weixi Feng
,
Wenhu Chen
,
William Yang Wang
EDIS: Entity-Driven Image Search over Multimodal Web Content.
CoRR
(2023)
Siqi Liu
,
Weixi Feng
,
Tsu-Jui Fu
,
Wenhu Chen
,
William Wang
EDIS: Entity-Driven Image Search over Multimodal Web Content.
EMNLP
(2023)
Weixi Feng
,
Wanrong Zhu
,
Tsu-Jui Fu
,
Varun Jampani
,
Arjun R. Akula
,
Xuehai He
,
Sugato Basu
,
Xin Eric Wang
,
William Yang Wang
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models.
NeurIPS
(2023)
Weixi Feng
,
Wanrong Zhu
,
Tsu-Jui Fu
,
Varun Jampani
,
Arjun R. Akula
,
Xuehai He
,
Sugato Basu
,
Xin Eric Wang
,
William Yang Wang
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models.
CoRR
(2023)
Yujie Lu
,
Weixi Feng
,
Wanrong Zhu
,
Wenda Xu
,
Xin Eric Wang
,
Miguel P. Eckstein
,
William Yang Wang
Neuro-Symbolic Procedural Planning with Commonsense Prompting.
ICLR
(2023)
Weixi Feng
,
Xuehai He
,
Tsu-Jui Fu
,
Varun Jampani
,
Arjun R. Akula
,
Pradyumna Narayana
,
Sugato Basu
,
Xin Eric Wang
,
William Yang Wang
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis.
ICLR
(2023)
Raphael Schumann
,
Wanrong Zhu
,
Weixi Feng
,
Tsu-Jui Fu
,
Stefan Riezler
,
William Yang Wang
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View.
CoRR
(2023)
Yujie Lu
,
Weixi Feng
,
Wanrong Zhu
,
Wenda Xu
,
Xin Eric Wang
,
Miguel P. Eckstein
,
William Yang Wang
Neuro-Symbolic Causal Language Planning with Commonsense Prompting.
CoRR
(2022)
Weixi Feng
,
Tsu-Jui Fu
,
Yujie Lu
,
William Yang Wang
ULN: Towards Underspecified Vision-and-Language Navigation.
CoRR
(2022)
Yujie Lu
,
Huiliang Zhang
,
Ping Nie
,
Weixi Feng
,
Wenda Xu
,
Xin Eric Wang
,
William Yang Wang
Anticipating the Unseen Discrepancy for Vision and Language Navigation.
CoRR
(2022)
Weixi Feng
,
Tsu-Jui Fu
,
Yujie Lu
,
William Yang Wang
ULN: Towards Underspecified Vision-and-Language Navigation.
EMNLP
(2022)
Xuehai He
,
Diji Yang
,
Weixi Feng
,
Tsu-Jui Fu
,
Arjun R. Akula
,
Varun Jampani
,
Pradyumna Narayana
,
Sugato Basu
,
William Yang Wang
,
Xin Eric Wang
CPL: Counterfactual Prompt Learning for Vision and Language Models.
CoRR
(2022)
Weixi Feng
,
Xuehai He
,
Tsu-Jui Fu
,
Varun Jampani
,
Arjun R. Akula
,
Pradyumna Narayana
,
Sugato Basu
,
Xin Eric Wang
,
William Yang Wang
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis.
CoRR
(2022)
Xuehai He
,
Diji Yang
,
Weixi Feng
,
Tsu-Jui Fu
,
Arjun R. Akula
,
Varun Jampani
,
Pradyumna Narayana
,
Sugato Basu
,
William Yang Wang
,
Xin Wang
CPL: Counterfactual Prompt Learning for Vision and Language Models.
EMNLP
(2022)