Login / Signup
STOA-VLP: Spatial-Temporal Modeling of Object and Action for Video-Language Pre-training.
Weihong Zhong
Mao Zheng
Duyu Tang
Xuan Luo
Heng Gong
Xiaocheng Feng
Bing Qin
Published in:
AAAI (2023)
Keyphrases
</>
spatial temporal
human actions
action recognition
video shots
spatial and temporal
spatial and temporal information
space time interest points
spatio temporal
temporal information
video sequences
spatial information
space time
visual features
d objects
human activities
human motion
video database
image segmentation