WTS: A Pedestrian-Centric Traffic Video Dataset for Fine-grained Spatial-Temporal Understanding.
Quan KongYuki KawanaRajat SainiAshutosh KumarJingjing PanTa GuYohei OzaoBalazs OpraDavid C. AnastasiuYoichi SatoNorimasa KoboriPublished in: CoRR (2024)
Keyphrases
- fine grained
- spatial temporal
- action recognition
- video dataset
- coarse grained
- human actions
- bag of words
- access control
- activity recognition
- temporal information
- spatial and temporal
- computer vision
- video shots
- spatio temporal
- spatial information
- video search
- image classification
- information retrieval systems
- information extraction
- video clips
- feature space
- video sequences