Login / Signup
Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization.
Chongzhi Zhang
Mingyuan Zhang
Zhiyang Teng
Jiayi Li
Xizhou Zhu
Lewei Lu
Ziwei Liu
Aixin Sun
Published in:
CoRR (2024)
Keyphrases
</>
diffusion models
multiscale
natural language
temporal information
information diffusion
video data
diffusion model
video streams
social networks
video sequences
video frames
video content
website
objective function
natural images
diffusion process
medical images
natural language processing
image processing