X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Transfer.
Linglin JingYing XueXu YanChaoda ZhengDong WangRuimao ZhangZhigang WangHui FangBin ZhaoZhen LiPublished in: CoRR (2023)
Keyphrases
- knowledge transfer
- scene understanding
- cross modal
- video surveillance
- multi modal
- object detection
- object recognition
- knowledge sharing
- d scene
- vision system
- visual data
- transfer learning
- image retrieval
- event detection
- video frames
- learning tasks
- multimedia databases
- video sequences
- background subtraction
- object tracking
- moving objects
- similarity measure
- computer vision
- information retrieval
- data mining
- video data