Login / Signup
Deyao Zhu
ORCID
Publication Activity (10 Years)
Years Active: 2019-2024
Publications (10 Years): 21
Top Topics
Boltzmann Machine
Maximum Likelihood Estimation
Reinforcement Learning
Language Model
Top Venues
CoRR
ICLR
ECCV (22)
GCPR
</>
Publications
</>
Kirolos Ataallah
,
Xiaoqian Shen
,
Eslam Abdelrahman
,
Essam Sleiman
,
Mingchen Zhuge
,
Jian Ding
,
Deyao Zhu
,
Jürgen Schmidhuber
,
Mohamed Elhoseiny
Goldfish: Vision-Language Understanding of Arbitrarily Long Videos.
CoRR
(2024)
Kirolos Ataallah
,
Xiaoqian Shen
,
Eslam Abdelrahman
,
Essam Sleiman
,
Deyao Zhu
,
Jian Ding
,
Mohamed Elhoseiny
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual Tokens.
CoRR
(2024)
Deyao Zhu
,
Jun Chen
,
Kilichbek Haydarov
,
Xiaoqian Shen
,
Wenxuan Zhang
,
Mohamed Elhoseiny
ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions.
Trans. Mach. Learn. Res.
2024 (2024)
Deyao Zhu
,
Jun Chen
,
Xiaoqian Shen
,
Xiang Li
,
Mohamed Elhoseiny
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models.
ICLR
(2024)
Asma Alkhaldi
,
Raneem Alnajim
,
Layan Alabdullatef
,
Rawan Alyahya
,
Jun Chen
,
Deyao Zhu
,
Ahmed Alsinan
,
Mohamed Elhoseiny
MiniGPT-Med: Large Language Model as a General Interface for Radiology Diagnosis.
CoRR
(2024)
Deyao Zhu
,
Jun Chen
,
Xiaoqian Shen
,
Xiang Li
,
Mohamed Elhoseiny
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models.
CoRR
(2023)
Jun Chen
,
Deyao Zhu
,
Guocheng Qian
,
Bernard Ghanem
,
Zhicheng Yan
,
Chenchen Zhu
,
Fanyi Xiao
,
Sean Chang Culatana
,
Mohamed Elhoseiny
Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation Only.
ICCV
(2023)
Deyao Zhu
,
Jun Chen
,
Kilichbek Haydarov
,
Xiaoqian Shen
,
Wenxuan Zhang
,
Mohamed Elhoseiny
ChatGPT Asks, BLIP-2 Answers: Automatic Questioning Towards Enriched Visual Descriptions.
CoRR
(2023)
Jun Chen
,
Deyao Zhu
,
Guocheng Qian
,
Bernard Ghanem
,
Zhicheng Yan
,
Chenchen Zhu
,
Fanyi Xiao
,
Mohamed Elhoseiny
,
Sean Chang Culatana
Exploring Open-Vocabulary Semantic Segmentation without Human Labels.
CoRR
(2023)
Deyao Zhu
,
Li Erran Li
,
Mohamed Elhoseiny
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning.
ICLR
(2023)
Jun Chen
,
Deyao Zhu
,
Xiaoqian Shen
,
Xiang Li
,
Zechun Liu
,
Pengchuan Zhang
,
Raghuraman Krishnamoorthi
,
Vikas Chandra
,
Yunyang Xiong
,
Mohamed Elhoseiny
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning.
CoRR
(2023)
Jun Chen
,
Deyao Zhu
,
Kilichbek Haydarov
,
Xiang Li
,
Mohamed Elhoseiny
Video ChatCaptioner: Towards Enriched Spatiotemporal Descriptions.
CoRR
(2023)
Deyao Zhu
,
Yuhui Wang
,
Jürgen Schmidhuber
,
Mohamed Elhoseiny
Guiding Online Reinforcement Learning with Action-Free Offline Pretraining.
CoRR
(2023)
Deyao Zhu
,
Li Erran Li
,
Mohamed Elhoseiny
Value Memory Graph: A Graph-Structured World Model for Offline Reinforcement Learning.
CoRR
(2022)
Abduallah A. Mohamed
,
Deyao Zhu
,
Warren Vu
,
Mohamed Elhoseiny
,
Christian G. Claudel
Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation.
ECCV (22)
(2022)
Abduallah A. Mohamed
,
Deyao Zhu
,
Warren Vu
,
Mohamed Elhoseiny
,
Christian G. Claudel
Social-Implicit: Rethinking Trajectory Prediction Evaluation and The Effectiveness of Implicit Maximum Likelihood Estimation.
CoRR
(2022)
Jun Chen
,
Aniket Agarwal
,
Sherif Abdelkarim
,
Deyao Zhu
,
Mohamed Elhoseiny
RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition.
CVPR
(2022)
Jun Chen
,
Aniket Agarwal
,
Sherif Abdelkarim
,
Deyao Zhu
,
Mohamed Elhoseiny
RelTransformer: Balancing the Visual Relationship Detection from Local Context, Scene and Memory.
CoRR
(2021)
Deyao Zhu
,
Mohamed Zahran
,
Li Erran Li
,
Mohamed Elhoseiny
HalentNet: Multimodal Trajectory Forecasting with Hallucinative Intents.
ICLR
(2021)
Deyao Zhu
,
Mohamed Zahran
,
Li Erran Li
,
Mohamed Elhoseiny
Motion Forecasting with Unlikelihood Training in Continuous Space.
CoRL
(2021)
Deyao Zhu
,
Marco Munderloh
,
Bodo Rosenhahn
,
Jörg Stückler
Learning to Disentangle Latent Physical Factors for Video Prediction.
GCPR
(2019)