​
Login / Signup
Zaid Khan
ORCID
Publication Activity (10 Years)
Years Active: 2020-2024
Publications (10 Years): 18
Top Topics
Sentiment Classification
Object Instances
Language Model
Scene Analysis
Top Venues
CoRR
FAccT
ACM Multimedia
ICLR
</>
Publications
</>
Zaid Khan
,
Vijay Kumar B. G
,
Samuel Schulter
,
Yun Fu
,
Manmohan Chandraker
Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement.
CoRR
(2024)
Zaid Khan
,
Yun Fu
Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering.
CoRR
(2024)
Zaid Khan
,
BG Vijay Kumar
,
Samuel Schulter
,
Xiang Yu
,
Yun Fu
,
Manmohan Chandraker
Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
CVPR
(2023)
Zaid Khan
,
Yun Fu
Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning.
ICLR
(2023)
Zaid Khan
,
Vijay Kumar B. G
,
Samuel Schulter
,
Manmohan Chandraker
,
Yun Fu
Exploring Question Decomposition for Zero-Shot VQA.
CoRR
(2023)
Zaid Khan
,
Vijay Kumar B. G
,
Samuel Schulter
,
Xiang Yu
,
Yun Fu
,
Manmohan Chandraker
Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!
CoRR
(2023)
Zaid Khan
,
Yun Fu
Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning.
CoRR
(2023)
Zaid Khan
,
Vijay Kumar B. G
,
Samuel Schulter
,
Manmohan Chandraker
,
Yun Fu
Exploring Question Decomposition for Zero-Shot VQA.
NeurIPS
(2023)
Zaid Khan
,
B. G. Vijay Kumar
,
Xiang Yu
,
Samuel Schulter
,
Manmohan Chandraker
,
Yun Fu
Single-Stream Multi-level Alignment for Vision-Language Pretraining.
ECCV (36)
(2022)
Zaid Khan
,
Vijay Kumar B. G
,
Xiang Yu
,
Samuel Schulter
,
Manmohan Chandraker
,
Yun Fu
Single-Stream Multi-Level Alignment for Vision-Language Pretraining.
CoRR
(2022)
Joseph P. Robinson
,
Zaid Khan
,
Yu Yin
,
Ming Shao
,
Yun Fu
Families in Wild Multimedia: A Multimodal Database for Recognizing Kinship.
IEEE Trans. Multim.
24 (2022)
Zaid Khan
,
Yun Fu
Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation.
CoRR
(2021)
Zaid Khan
,
Yun Fu
One Label, One Billion Faces: Usage and Consistency of Racial Categories in Computer Vision.
CoRR
(2021)
Zaid Khan
,
Yun Fu
Exploiting BERT for Multimodal Target Sentiment Classification through Input Space Translation.
ACM Multimedia
(2021)
Zaid Khan
,
Yun Fu
One Label, One Billion Faces: Usage and Consistency of Racial Categories in Computer Vision.
FAccT
(2021)
Joseph P. Robinson
,
Yu Yin
,
Zaid Khan
,
Ming Shao
,
Siyu Xia
,
Michael Stopa
,
Samson Timoner
,
Matthew A. Turk
,
Rama Chellappa
,
Yun Fu
Recognizing Families In the Wild (RFIW): The 4th Edition.
FG
(2020)
Joseph P. Robinson
,
Zaid Khan
,
Yu Yin
,
Ming Shao
,
Yun Fu
Families In Wild Multimedia (FIW-MM): A Multi-Modal Database for Recognizing Kinship.
CoRR
(2020)
Joseph P. Robinson
,
Yu Yin
,
Zaid Khan
,
Ming Shao
,
Siyu Xia
,
Michael Stopa
,
Samson Timoner
,
Matthew A. Turk
,
Rama Chellappa
,
Yun Fu
Recognizing Families In the Wild (RFIW): The 4th Edition.
CoRR
(2020)