Sign in

Learning CLIP Guided Visual-Text Fusion Transformer for Video-based Pedestrian Attribute Recognition.

Jun ZhuJiandong JinZihan YangXiaohao WuXiao Wang
Published in: CVPR Workshops (2023)
Keyphrases