Login / Signup
Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition.
Jun Zhu
Jiandong Jin
Zihan Yang
Xiaohao Wu
Xiao Wang
Published in:
CoRR (2023)
Keyphrases
</>
visual learning
visual recognition
learning process
unsupervised learning
reinforcement learning
object recognition
object detection
supervised learning
online learning
learning systems
recognition accuracy
visual processing
information retrieval
video sequences
fuzzy logic