Login / Signup
Mu Cai
Publication Activity (10 Years)
Years Active: 2020-2024
Publications (10 Years): 21
Top Topics
Point Cloud
Frequency Domain
Language Model
Vector Graphics
Top Venues
CoRR
ICCV
ACL (Findings)
ECCV (2)
</>
Publications
</>
Xiang Li
,
Cristina Mata
,
Jongwoo Park
,
Kumara Kahatapitiya
,
Yoo Sung Jang
,
Jinghuan Shang
,
Kanchana Ranasinghe
,
Ryan Burgert
,
Mu Cai
,
Yong Jae Lee
,
Michael S. Ryoo
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy.
CoRR
(2024)
Jianrui Zhang
,
Mu Cai
,
Tengyang Xie
,
Yong Jae Lee
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples.
CoRR
(2024)
Yuzhang Shang
,
Mu Cai
,
Bingxin Xu
,
Yong Jae Lee
,
Yan Yan
LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models.
CoRR
(2024)
Thao Nguyen
,
Haotian Liu
,
Yuheng Li
,
Mu Cai
,
Utkarsh Ojha
,
Yong Jae Lee
Yo'LLaVA: Your Personalized Language and Vision Assistant.
CoRR
(2024)
Jianrui Zhang
,
Mu Cai
,
Tengyang Xie
,
Yong Jae Lee
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples.
ACL (Findings)
(2024)
Mu Cai
,
Jianwei Yang
,
Jianfeng Gao
,
Yong Jae Lee
Matryoshka Multimodal Models.
CoRR
(2024)
Bocheng Zou
,
Mu Cai
,
Jianrui Zhang
,
Yong Jae Lee
VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation.
CoRR
(2024)
Yuexiang Zhai
,
Shengbang Tong
,
Xiao Li
,
Mu Cai
,
Qing Qu
,
Yong Jae Lee
,
Yi Ma
Investigating the Catastrophic Forgetting in Multimodal Large Language Models.
CoRR
(2023)
Zeyi Huang
,
Andy Zhou
,
Zijian Lin
,
Mu Cai
,
Haohan Wang
,
Yong Jae Lee
A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance.
ICCV
(2023)
Mu Cai
,
Yixuan Li
Out-of-distribution Detection via Frequency-regularized Generative Models.
WACV
(2023)
Zeyi Huang
,
Andy Zhou
,
Zijian Lin
,
Mu Cai
,
Haohan Wang
,
Yong Jae Lee
A Sentence Speaks a Thousand Images: Domain Generalization through Distilling CLIP with Language Guidance.
CoRR
(2023)
Mu Cai
,
Haotian Liu
,
Siva Karthik Mustikovela
,
Gregory P. Meyer
,
Yuning Chai
,
Dennis Park
,
Yong Jae Lee
Making Large Multimodal Models Understand Arbitrary Visual Prompts.
CoRR
(2023)
Mu Cai
,
Zeyi Huang
,
Yuheng Li
,
Haohan Wang
,
Yong Jae Lee
Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding.
CoRR
(2023)
Mu Cai
,
Yixuan Li
Out-of-distribution Detection via Frequency-regularized Generative Models.
CoRR
(2022)
Xuefeng Du
,
Zhaoning Wang
,
Mu Cai
,
Yixuan Li
VOS: Learning What You Don't Know by Virtual Outlier Synthesis.
CoRR
(2022)
Xuefeng Du
,
Zhaoning Wang
,
Mu Cai
,
Yixuan Li
VOS: Learning What You Don't Know by Virtual Outlier Synthesis.
ICLR
(2022)
Haotian Liu
,
Mu Cai
,
Yong Jae Lee
Masked Discrimination for Self-Supervised Learning on Point Clouds.
CoRR
(2022)
Haotian Liu
,
Mu Cai
,
Yong Jae Lee
Masked Discrimination for Self-supervised Learning on Point Clouds.
ECCV (2)
(2022)
Mu Cai
,
Hong Zhang
,
Huijuan Huang
,
Qichuan Geng
,
Yixuan Li
,
Gao Huang
Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving.
ICCV
(2021)
Liting Sun
,
Mu Cai
,
Wei Zhan
,
Masayoshi Tomizuka
A Game-Theoretic Strategy-Aware Interaction Algorithm with Validation on Real Traffic Data.
IROS
(2020)
Mu Cai
,
Hong Zhang
,
Huijuan Huang
,
Qichuan Geng
,
Gao Huang
Frequency Domain Image Translation: More Photo-realistic, Better Identity-preserving.
CoRR
(2020)