Login / Signup

An Intermediate Fusion ViT Enables Efficient Text-Image Alignment in Diffusion Models.

Zizhao HuShaochong JiaMohammad Rostami
Published in: CoRR (2024)
Keyphrases