Login / Signup
Vasudev Lal
Publication Activity (10 Years)
Years Active: 2021-2024
Publications (10 Years): 41
Top Topics
Diffusion Model
Term Extraction
Language Model
Trec Collections
Top Venues
CoRR
CIKM
NeurIPS
ACL (1)
</>
Publications
</>
Zhipeng Cai
,
Matthias Mueller
,
Reiner Birkl
,
Diana Wofk
,
Shao-Yen Tseng
,
Junda Cheng
,
Gabriela Ben Melech Stan
,
Vasudev Lal
,
Michael Paulitsch
L-MAGIC: Language Model Assisted Generation of Images with Coherence.
CoRR
(2024)
Gabriela Ben Melech Stan
,
Raanan Y. Yehezkel Rohekar
,
Yaniv Gurwicz
,
Matthew Lyle Olson
,
Anahita Bhiwandiwalla
,
Estelle Aflalo
,
Chenfei Wu
,
Nan Duan
,
Shao-Yen Tseng
,
Vasudev Lal
LVLM-Intrepret: An Interpretability Tool for Large Vision-Language Models.
CoRR
(2024)
Shachar Rosenman
,
Vasudev Lal
,
Phillip Howard
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation.
EACL (Demonstrations)
(2024)
Musashi Hinck
,
Carolin Holtermann
,
Matthew Lyle Olson
,
Florian Schneider
,
Sungduk Yu
,
Anahita Bhiwandiwalla
,
Anne Lauscher
,
Shao-Yen Tseng
,
Vasudev Lal
Why do LLaVA Vision-Language Models Reply to Images in English?
CoRR
(2024)
Xin Su
,
Man Luo
,
Kris W. Pan
,
Tien Pei Chou
,
Vasudev Lal
,
Phillip Howard
SK-VQA: Synthetic Knowledge Generation at Scale for Training Context-Augmented Multimodal LLMs.
CoRR
(2024)
Musashi Hinck
,
Matthew L. Olson
,
David Cobbley
,
Shao-Yen Tseng
,
Vasudev Lal
LLaVA-Gemma: Accelerating Multimodal Foundation Models with a Compact Language Model.
CoRR
(2024)
Agneet Chatterjee
,
Gabriela Ben Melech Stan
,
Estelle Aflalo
,
Sayak Paul
,
Dhruba Ghosh
,
Tejas Gokhale
,
Ludwig Schmidt
,
Hannaneh Hajishirzi
,
Vasudev Lal
,
Chitta Baral
,
Yezhou Yang
Getting it Right: Improving Spatial Consistency in Text-to-Image Models.
CoRR
(2024)
Avinash Madasu
,
Estelle Aflalo
,
Gabriela Ben Melech Stan
,
Shao-Yen Tseng
,
Gedas Bertasius
,
Vasudev Lal
Improving Video Retrieval Using Multilingual Knowledge Transfer.
ECIR (1)
(2023)
Tiep Le
,
Vasudev Lal
,
Phillip Howard
COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs.
NeurIPS
(2023)
Phillip Howard
,
Junlin Wang
,
Vasudev Lal
,
Gadi Singer
,
Yejin Choi
,
Swabha Swayamdipta
NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge.
CoRR
(2023)
Phillip Howard
,
Avinash Madasu
,
Tiep Le
,
Gustavo A. Lujan-Moreno
,
Vasudev Lal
Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples.
CoRR
(2023)
Xiao Xu
,
Bei Li
,
Chenfei Wu
,
Shao-Yen Tseng
,
Anahita Bhiwandiwalla
,
Shachar Rosenman
,
Vasudev Lal
,
Wanxiang Che
,
Nan Duan
ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning.
ACL (1)
(2023)
Xiao Xu
,
Chenfei Wu
,
Shachar Rosenman
,
Vasudev Lal
,
Wanxiang Che
,
Nan Duan
BridgeTower: Building Bridges between Encoders in Vision-Language Representation Learning.
AAAI
(2023)
Gadi Singer
,
Joscha Bach
,
Tetiana Grinberg
,
Nagib Hakim
,
Phillip Howard
,
Vasudev Lal
,
Zev Rivlin
Thrill-K Architecture: Towards a Solution to the Problem of Knowledge Based Understanding.
CoRR
(2023)
Jerry Tang
,
Meng Du
,
Vy A. Vo
,
Vasudev Lal
,
Alexander Huth
Brain encoding models based on multimodal transformers can transfer across language and vision.
NeurIPS
(2023)
Avinash Madasu
,
Vasudev Lal
Is multi-modal vision supervision beneficial to language?
CoRR
(2023)
Avinash Madasu
,
Estelle Aflalo
,
Gabriela Ben Melech Stan
,
Shachar Rosenman
,
Shao-Yen Tseng
,
Gedas Bertasius
,
Vasudev Lal
MuMUR: Multilingual Multimodal Universal Retrieval.
Inf. Retr. J.
26 (1) (2023)
Avinash Madasu
,
Vasudev Lal
ICSVR: Investigating Compositional and Semantic Understanding in Video Retrieval Models.
CoRR
(2023)
Avinash Madasu
,
Anahita Bhiwandiwalla
,
Vasudev Lal
Analyzing Zero-Shot Abilities of Vision-Language Models on Video Understanding Tasks.
CoRR
(2023)
Gabriela Ben Melech Stan
,
Diana Wofk
,
Scottie Fox
,
Alex Redden
,
Will Saxton
,
Jean Yu
,
Estelle Aflalo
,
Shao-Yen Tseng
,
Fabio Nonato
,
Matthias Müller
,
Vasudev Lal
LDM3D: Latent Diffusion Model for 3D.
CoRR
(2023)
Xiao Xu
,
Bei Li
,
Chenfei Wu
,
Shao-Yen Tseng
,
Anahita Bhiwandiwalla
,
Shachar Rosenman
,
Vasudev Lal
,
Wanxiang Che
,
Nan Duan
ManagerTower: Aggregating the Insights of Uni-Modal Experts for Vision-Language Representation Learning.
CoRR
(2023)
Shachar Rosenman
,
Vasudev Lal
,
Phillip Howard
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation.
CoRR
(2023)
Phillip Howard
,
Avinash Madasu
,
Tiep Le
,
Gustavo A. Lujan-Moreno
,
Anahita Bhiwandiwalla
,
Vasudev Lal
Probing and Mitigating Intersectional Social Biases in Vision-Language Models with Counterfactual Examples.
CoRR
(2023)
Gabriela Ben Melech Stan
,
Diana Wofk
,
Estelle Aflalo
,
Shao-Yen Tseng
,
Zhipeng Cai
,
Michael Paulitsch
,
Vasudev Lal
LDM3D-VR: Latent Diffusion Model for 3D VR.
CoRR
(2023)
Tiep Le
,
Vasudev Lal
,
Phillip Howard
COCO-Counterfactuals: Automatically Constructed Counterfactual Examples for Image-Text Pairs.
CoRR
(2023)
Jerry Tang
,
Meng Du
,
Vy A. Vo
,
Vasudev Lal
,
Alexander G. Huth
Brain encoding models based on multimodal transformers can transfer across language and vision.
CoRR
(2023)
Avinash Madasu
,
Vasudev Lal
Is Multimodal Vision Supervision Beneficial to Language?
CVPR Workshops
(2023)
Phillip Howard
,
Gadi Singer
,
Vasudev Lal
,
Yejin Choi
,
Swabha Swayamdipta
NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation.
CoRR
(2022)
Ayal Klein
,
Oren Pereg
,
Daniel Korat
,
Vasudev Lal
,
Moshe Wasserblat
,
Ido Dagan
Opinion-based Relational Pivoting for Cross-domain Aspect Term Extraction.
WASSA@ACL
(2022)
Avinash Madasu
,
Estelle Aflalo
,
Gabriela Ben Melech Stan
,
Shao-Yen Tseng
,
Gedas Bertasius
,
Vasudev Lal
Improving video retrieval using multilingual knowledge transfer.
CoRR
(2022)
Phillip Howard
,
Arden Ma
,
Vasudev Lal
,
Ana Paula Simães
,
Daniel Korat
,
Oren Pereg
,
Moshe Wasserblat
,
Gadi Singer
Cross-Domain Aspect Extraction using Transformers Augmented with Knowledge Graphs.
CIKM
(2022)
Estelle Aflalo
,
Meng Du
,
Shao-Yen Tseng
,
Yongfei Liu
,
Chenfei Wu
,
Nan Duan
,
Vasudev Lal
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers.
CVPR
(2022)
Phillip Howard
,
Arden Ma
,
Vasudev Lal
,
Ana Paula Simões
,
Daniel Korat
,
Oren Pereg
,
Moshe Wasserblat
,
Gadi Singer
Cross-Domain Aspect Extraction using Transformers Augmented with Knowledge Graphs.
CoRR
(2022)
Xiao Xu
,
Chenfei Wu
,
Shachar Rosenman
,
Vasudev Lal
,
Nan Duan
Bridge-Tower: Building Bridges Between Encoders in Vision-Language Representation Learning.
CoRR
(2022)
Gadi Singer
,
Joscha Bach
,
Tetiana Grinberg
,
Nagib Hakim
,
Phillip R. Howard
,
Vasudev Lal
,
Zev Rivlin
Thrill-K Architecture: Towards a Solution to the Problem of Knowledge Based Understanding.
AGI
(2022)
Estelle Aflalo
,
Meng Du
,
Shao-Yen Tseng
,
Yongfei Liu
,
Chenfei Wu
,
Nan Duan
,
Vasudev Lal
VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers.
CoRR
(2022)
Phillip Howard
,
Gadi Singer
,
Vasudev Lal
,
Yejin Choi
,
Swabha Swayamdipta
NeuroCounterfactuals: Beyond Minimal-Edit Counterfactuals for Richer Data Augmentation.
EMNLP (Findings)
(2022)
Yongfei Liu
,
Chenfei Wu
,
Shao-Yen Tseng
,
Vasudev Lal
,
Xuming He
,
Nan Duan
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation.
NAACL-HLT (Findings)
(2022)
Vasudev Lal
,
Somak Aditya
,
Yezhou Yang
,
Pasquale Minervini
,
Sandya Mannarswamy
First Workshop on Knowledge Injection in Neural Networks (KINN).
CIKM
(2021)
Vasudev Lal
,
Arden Ma
,
Estelle Aflalo
,
Phillip Howard
,
Ana Simoes
,
Daniel Korat
,
Oren Pereg
,
Gadi Singer
,
Moshe Wasserblat
InterpreT: An Interactive Visualization Tool for Interpreting Transformers.
EACL (System Demonstrations)
(2021)
Yongfei Liu
,
Chenfei Wu
,
Shao-yen Tseng
,
Vasudev Lal
,
Xuming He
,
Nan Duan
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation.
CoRR
(2021)