Sign in

Pseudo 3D Perception Transformer with Multi-level Confidence Optimization for Visual Commonsense Reasoning.

Jian ZhuHanli Wang
Published in: CoRR (2023)
Keyphrases