Login / Signup
LGM3A@MM
2023
2023
2023
Keyphrases
Publications
2023
Changrong Xiao
,
Sean Xin Xu
,
Kunpeng Zhang
Multimodal Data Augmentation for Image Captioning using Diffusion Models.
LGM3A@MM
(2023)
Federico Rossetto
,
Jeffrey Dalton
,
Roderick Murray-Smith
Generating Multimodal Augmentations with LLMs from Song Metadata for Music Information Retrieval.
LGM3A@MM
(2023)
Qian Yong
,
Jueqi Wei
,
YiRen Zhang
,
XiLun Zhang
,
Chao Wei
,
Simiao Chen
,
Yunhe Li
,
Cheng Ye
,
Bing Huang
,
Hao Wang
CGSMP: Controllable Generative Summarization via Multimodal Prompt.
LGM3A@MM
(2023)
Jing Huang
,
Tianyi Zhang
,
Wei Shi
SAT: Self-Attention Control for Diffusion Models Training.
LGM3A@MM
(2023)
Qianqian Chen
,
Tianyi Zhang
,
Maowen Nie
,
Zheng Wang
,
Shihao Xu
,
Wei Shi
,
Zhao Cao
Fashion-GPT: Integrating LLMs with Fashion Retrieval System.
LGM3A@MM
(2023)
Mingliang Liang
,
Martha A. Larson
Subsampling of Frequent Words in Text for Pre-training a Vision-Language Model.
LGM3A@MM
(2023)
Proceedings of the 1st Workshop on Large Generative Models Meet Multimodal Applications, LGM3A 2023, Ottawa ON, Canada, 2 November 2023
LGM3A@MM
(2023)
Boyang Li
Unlocking Multimedia Capabilities of Gigantic Pretrained Language Models.
LGM3A@MM
(2023)
Ziwei Liu
Multi-Modal Generative AI with Foundation Models.
LGM3A@MM
(2023)
Mike Zheng Shou
Large Generative Models Meet Multimodal Video Intelligence.
LGM3A@MM
(2023)
Tasnim Mohiuddin
,
Tianyi Zhang
,
Maowen Nie
,
Jing Huang
,
Qianqian Chen
,
Wei Shi
ImEW: A Framework for Editing Image in the Wild.
LGM3A@MM
(2023)
Zheng Wang
,
Fei Li
,
Cheng Long
NeurSEG: A Segment Driven Deep Neural Model for Nested Named Entity Recognition.
LGM3A@MM
(2023)