Login / Signup
MS-DETR: Natural Language Video Localization with Sampling Moment-Moment Interaction.
Jing Wang
Aixin Sun
Hao Zhang
Xiaoli Li
Published in:
CoRR (2023)
Keyphrases
</>
natural language
video data
activity detection
machine learning
video sequences
information extraction
real time
video content
user interaction
video frames
video clips
language processing