Login / Signup
MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models.
Jiuming Liu
Jinru Han
Lihao Liu
Angelica I. Avilés-Rivero
Chaokang Jiang
Zhe Liu
Hesheng Wang
Published in:
CoRR (2024)
Keyphrases
</>
spatial temporal
point cloud
state space
video shots
spatial and temporal
structure from motion
spatio temporal
temporal information
video content
human actions
reinforcement learning
action recognition
spatial information
multimedia
video sequences
video data
video frames
video retrieval
image sequences