​
Login / Signup
Wei Ji
ORCID
Publication Activity (10 Years)
Years Active: 2018-2024
Publications (10 Years): 58
Top Topics
Saliency Detection
Autonomous Driving
Language Model
Depth Images
Top Venues
CoRR
AAAI
IEEE Trans. Image Process.
CVPR
</>
Publications
</>
Meng Wei
,
Long Chen
,
Wei Ji
,
Xiaoyu Yue
,
Roger Zimmermann
In Defense of Clip-Based Video Relation Detection.
IEEE Trans. Image Process.
33 (2024)
Li Li
,
You Qin
,
Wei Ji
,
Yuxiao Zhou
,
Roger Zimmermann
Domain-Wise Invariant Learning for Panoptic Scene Graph Generation.
ICASSP
(2024)
Yiyang Chen
,
Zhedong Zheng
,
Wei Ji
,
Leigang Qu
,
Tat-Seng Chua
Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization.
ICLR
(2024)
Juncheng Li
,
Kaihang Pan
,
Zhiqi Ge
,
Minghe Gao
,
Wei Ji
,
Wenqiao Zhang
,
Tat-Seng Chua
,
Siliang Tang
,
Hanwang Zhang
,
Yueting Zhuang
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions.
ICLR
(2024)
Wei Ji
,
Li Li
,
Zheqi Lv
,
Wenqiao Zhang
,
Mengze Li
,
Zhen Wan
,
Wenqiang Lei
,
Roger Zimmermann
Backpropogation-Free Multi-modal On-Device Model Adaptation via Cloud-Device Collaboration.
CoRR
(2024)
Wei Ji
,
Xiangyan Liu
,
Yingfei Sun
,
Jiajun Deng
,
You Qin
,
Ammar Nuwanna
,
Mengyao Qiu
,
Lina Wei
,
Roger Zimmermann
Described Spatial-Temporal Video Detection.
CoRR
(2024)
Li Li
,
Wei Ji
,
Yiming Wu
,
Mengze Li
,
You Qin
,
Lina Wei
,
Roger Zimmermann
Panoptic Scene Graph Generation with Semantics-Prototype Learning.
AAAI
(2024)
Jiahang Tu
,
Wei Ji
,
Hanbin Zhao
,
Chao Zhang
,
Roger Zimmermann
,
Hui Qian
DriveDiTFit: Fine-tuning Diffusion Transformers for Autonomous Driving.
CoRR
(2024)
Wei Ji
,
You Qin
,
Long Chen
,
Yinwei Wei
,
Yiming Wu
,
Roger Zimmermann
Mrtnet: Multi-Resolution Temporal Network for Video Sentence Grounding.
ICASSP
(2024)
Kaihang Pan
,
Juncheng Li
,
Wenjie Wang
,
Hao Fei
,
Hongye Song
,
Wei Ji
,
Jun Lin
,
Xiaozhong Liu
,
Tat-Seng Chua
,
Siliang Tang
I3: Intent-Introspective Retrieval Conditioned on Instructions.
SIGIR
(2024)
Wei Ji
,
Ruiqi Shi
,
Yinwei Wei
,
Shanshan Zhao
,
Roger Zimmermann
Weakly Supervised Video Moment Retrieval via Location-irrelevant Proposal Learning.
WWW (Companion Volume)
(2024)
Wei Ji
,
Renjie Liang
,
Zhedong Zheng
,
Wenqiao Zhang
,
Shengyu Zhang
,
Juncheng Li
,
Mengze Li
,
Tat-Seng Chua
Are Binary Annotations Sufficient? Video Moment Retrieval via Hierarchical Uncertainty-based Active Learning.
CVPR
(2023)
Shengqiong Wu
,
Hao Fei
,
Wei Ji
,
Tat-Seng Chua
Cross2StrA: Unpaired Cross-lingual Image Captioning with Cross-lingual Cross-modal Structure-pivoted Alignment.
ACL (1)
(2023)
Shengyu Zhang
,
Xusheng Feng
,
Wenyan Fan
,
Wenjing Fang
,
Fuli Feng
,
Wei Ji
,
Shuo Li
,
Li Wang
,
Shanshan Zhao
,
Zhou Zhao
,
Tat-Seng Chua
,
Fei Wu
Video-Audio Domain Generalization via Confounder Disentanglement.
AAAI
(2023)
Minghe Gao
,
Juncheng Li
,
Hao Fei
,
Liang Pang
,
Wei Ji
,
Guoming Wang
,
Wenqiao Zhang
,
Siliang Tang
,
Yueting Zhuang
De-fine: Decomposing and Refining Visual Programs with Auto-Feedback.
CoRR
(2023)
Peng Qi
,
Yuyang Zhao
,
Yufeng Shen
,
Wei Ji
,
Juan Cao
,
Tat-Seng Chua
Two Heads Are Better Than One: Improving Fake News Video Detection by Correlating with Neighbors.
CoRR
(2023)
Meng Chu
,
Zhedong Zheng
,
Wei Ji
,
Tat-Seng Chua
Towards Natural Language-Guided Drones: GeoText-1652 Benchmark with Spatially Relation Matching.
CoRR
(2023)
Peng Qi
,
Yuyan Bu
,
Juan Cao
,
Wei Ji
,
Ruihao Shui
,
Junbin Xiao
,
Danding Wang
,
Tat-Seng Chua
FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms.
AAAI
(2023)
Meng Wei
,
Long Chen
,
Wei Ji
,
Xiaoyu Yue
,
Roger Zimmermann
In Defense of Clip-based Video Relation Detection.
CoRR
(2023)
Wei Ji
,
Renjie Liang
,
Lizi Liao
,
Hao Fei
,
Fuli Feng
Partial Annotation-based Video Moment Retrieval via Iterative Learning.
ACM Multimedia
(2023)
Ao Zhang
,
Wei Ji
,
Tat-Seng Chua
NExT-Chat: An LMM for Chat, Detection and Segmentation.
CoRR
(2023)
Ao Zhang
,
Hao Fei
,
Yuan Yao
,
Wei Ji
,
Li Li
,
Zhiyuan Liu
,
Tat-Seng Chua
VPGTrans: Transfer Visual Prompt Generator across LLMs.
NeurIPS
(2023)
Mengze Li
,
Han Wang
,
Wenqiao Zhang
,
Jiaxu Miao
,
Zhou Zhao
,
Shengyu Zhang
,
Wei Ji
,
Fei Wu
WINNER: Weakly-supervised hIerarchical decompositioN and aligNment for spatio-tEmporal video gRounding.
CVPR
(2023)
Juncheng Li
,
Kaihang Pan
,
Zhiqi Ge
,
Minghe Gao
,
Hanwang Zhang
,
Wei Ji
,
Wenqiao Zhang
,
Tat-Seng Chua
,
Siliang Tang
,
Yueting Zhuang
Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions.
CoRR
(2023)
Shengqiong Wu
,
Hao Fei
,
Leigang Qu
,
Wei Ji
,
Tat-Seng Chua
NExT-GPT: Any-to-Any Multimodal LLM.
CoRR
(2023)
Wei Ji
,
Yinwei Wei
,
Zhedong Zheng
,
Hao Fei
,
Tat-Seng Chua
Deep Multimodal Learning for Information Retrieval.
ACM Multimedia
(2023)
Qifan Yu
,
Juncheng Li
,
Yu Wu
,
Siliang Tang
,
Wei Ji
,
Yueting Zhuang
Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World.
ICCV
(2023)
Yali Du
,
Yinwei Wei
,
Wei Ji
,
Fan Liu
,
Xin Luo
,
Liqiang Nie
Multi-queue Momentum Contrast for Microvideo-Product Retrieval.
WSDM
(2023)
Yu Zhao
,
Hao Fei
,
Wei Ji
,
Jianguo Wei
,
Meishan Zhang
,
Min Zhang
,
Tat-Seng Chua
Generating Visual Spatial Description via Holistic 3D Scene Understanding.
ACL (1)
(2023)
Juncheng Li
,
Minghe Gao
,
Longhui Wei
,
Siliang Tang
,
Wenqiao Zhang
,
Mengze Li
,
Wei Ji
,
Qi Tian
,
Tat-Seng Chua
,
Yueting Zhuang
Gradient-Regulated Meta-Prompt Learning for Generalizable Vision-Language Models.
CoRR
(2023)
Wei Ji
,
Li Li
,
Hao Fei
,
Xiangyan Liu
,
Xun Yang
,
Juncheng Li
,
Roger Zimmermann
Towards Complex-query Referring Image Segmentation: A Novel Benchmark.
CoRR
(2023)
Li Li
,
Wei Ji
,
Yiming Wu
,
Mengze Li
,
You Qin
,
Lina Wei
,
Roger Zimmermann
Panoptic Scene Graph Generation with Semantics-prototype Learning.
CoRR
(2023)
Hao Fei
,
Shengqiong Wu
,
Wei Ji
,
Hanwang Zhang
,
Tat-Seng Chua
Empowering Dynamics-aware Text-to-Video Diffusion with Large Language Models.
CoRR
(2023)
Peng Qi
,
Yuyang Zhao
,
Yufeng Shen
,
Wei Ji
,
Juan Cao
,
Tat-Seng Chua
Two Heads Are Better Than One: Improving Fake News Video Detection by Correlating with Neighbors.
ACL (Findings)
(2023)
Qifan Yu
,
Juncheng Li
,
Yu Wu
,
Siliang Tang
,
Wei Ji
,
Yueting Zhuang
Visually-Prompted Language Model for Fine-Grained Scene Graph Generation in an Open World.
CoRR
(2023)
Kaihang Pan
,
Juncheng Li
,
Hongye Song
,
Hao Fei
,
Wei Ji
,
Shuo Zhang
,
Jun Lin
,
Xiaozhong Liu
,
Siliang Tang
ControlRetriever: Harnessing the Power of Instructions for Controllable Retrieval.
CoRR
(2023)
Feifei Shao
,
Long Chen
,
Jian Shao
,
Wei Ji
,
Shaoning Xiao
,
Lu Ye
,
Yueting Zhuang
,
Jun Xiao
Deep Learning for Weakly-Supervised Object Detection and Localization: A Survey.
Neurocomputing
496 (2022)
Yuan Yao
,
Qianyu Chen
,
Ao Zhang
,
Wei Ji
,
Zhiyuan Liu
,
Tat-Seng Chua
,
Maosong Sun
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models.
EMNLP
(2022)
Ao Zhang
,
Yuan Yao
,
Qianyu Chen
,
Wei Ji
,
Zhiyuan Liu
,
Maosong Sun
,
Tat-Seng Chua
Fine-Grained Scene Graph Generation with Data Transfer.
CoRR
(2022)
Yaoyao Zhong
,
Wei Ji
,
Junbin Xiao
,
Yicong Li
,
Weihong Deng
,
Tat-Seng Chua
Video Question Answering: Datasets, Algorithms and Challenges.
EMNLP
(2022)
Wei Ji
,
Long Chen
,
Yinwei Wei
,
Yiming Wu
,
Tat-Seng Chua
MRTNet: Multi-Resolution Temporal Network for Video Sentence Grounding.
CoRR
(2022)
Meng Wei
,
Long Chen
,
Wei Ji
,
Xiaoyu Yue
,
Tat-Seng Chua
Rethinking the Two-Stage Framework for Grounded Situation Recognition.
AAAI
(2022)
Zhedong Zheng
,
Jiayin Zhu
,
Wei Ji
,
Yi Yang
,
Tat-Seng Chua
3D Magic Mirror: Clothing Reconstruction from a Single Image via a Causal Perspective.
CoRR
(2022)
Junbin Xiao
,
Angela Yao
,
Zhiyuan Liu
,
Yicong Li
,
Wei Ji
,
Tat-Seng Chua
Video as Conditional Graph Hierarchy for Multi-Granular Question Answering.
AAAI
(2022)
Ao Zhang
,
Yuan Yao
,
Qianyu Chen
,
Wei Ji
,
Zhiyuan Liu
,
Maosong Sun
,
Tat-Seng Chua
Fine-Grained Scene Graph Generation with Data Transfer.
ECCV (27)
(2022)
Yuan Yao
,
Qianyu Chen
,
Ao Zhang
,
Wei Ji
,
Zhiyuan Liu
,
Tat-Seng Chua
,
Maosong Sun
PEVL: Position-enhanced Pre-training and Prompt Tuning for Vision-language Models.
CoRR
(2022)
Yicong Li
,
Xiang Wang
,
Junbin Xiao
,
Wei Ji
,
Tat-Seng Chua
Invariant Grounding for Video Question Answering.
CVPR
(2022)
Guanghao Yin
,
Wei Wang
,
Zehuan Yuan
,
Chuchu Han
,
Wei Ji
,
Shouqian Sun
,
Changhu Wang
Content-Variant Reference Image Quality Assessment via Knowledge Distillation.
AAAI
(2022)
Guanghao Yin
,
Wei Wang
,
Zehuan Yuan
,
Wei Ji
,
Dongdong Yu
,
Shouqian Sun
,
Tat-Seng Chua
,
Changhu Wang
Conditional Hyper-Network for Blind Super-Resolution With Multiple Degradations.
IEEE Trans. Image Process.
31 (2022)
Yiyang Chen
,
Zhedong Zheng
,
Wei Ji
,
Leigang Qu
,
Tat-Seng Chua
Composed Image Retrieval with Text Feedback via Multi-grained Uncertainty Regularization.
CoRR
(2022)
Yicong Li
,
Xiang Wang
,
Junbin Xiao
,
Wei Ji
,
Tat-Seng Chua
Invariant Grounding for Video Question Answering.
CoRR
(2022)
Chenchen Ye
,
Lizi Liao
,
Fuli Feng
,
Wei Ji
,
Tat-Seng Chua
Structured and Natural Responses Co-generation for Conversational Search.
SIGIR
(2022)
Peng Qi
,
Yuyan Bu
,
Juan Cao
,
Wei Ji
,
Ruihao Shui
,
Junbin Xiao
,
Danding Wang
,
Tat-Seng Chua
FakeSV: A Multimodal Benchmark with Rich Social Context for Fake News Detection on Short Video Platforms.
CoRR
(2022)
Yiming Wu
,
Wei Ji
,
Xi Li
,
Gang Wang
,
Jianwei Yin
,
Fei Wu
Context-Aware Deep Spatiotemporal Network for Hand Pose Estimation From Depth Images.
IEEE Trans. Cybern.
50 (2) (2020)
Wei Ji
,
Xi Li
,
Lina Wei
,
Fei Wu
,
Yueting Zhuang
Context-Aware Graph Label Propagation Network for Saliency Detection.
IEEE Trans. Image Process.
29 (2020)
Wei Ji
,
Xi Li
,
Fei Wu
,
Zhijie Pan
,
Yueting Zhuang
Human-Centric Clothing Segmentation via Deformable Semantic Locality-Preserving Network.
IEEE Trans. Circuits Syst. Video Technol.
30 (12) (2020)
Xi Li
,
Liming Zhao
,
Wei Ji
,
Yiming Wu
,
Fei Wu
,
Ming-Hsuan Yang
,
Dacheng Tao
,
Ian Reid
Multi-Task Structure-Aware Context Modeling for Robust Keypoint-Based Object Tracking.
IEEE Trans. Pattern Anal. Mach. Intell.
41 (4) (2019)
Yiming Wu
,
Wei Ji
,
Xi Li
,
Gang Wang
,
Jianwei Yin
,
Fei Wu
Context-Aware Deep Spatio-Temporal Network for Hand Pose Estimation from Depth Images.
CoRR
(2018)