Sign in

3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment.

Ziyu ZhuXiaojian MaYixin ChenZhidong DengSiyuan HuangQing Li
Published in: ICCV (2023)
Keyphrases
  • pre trained
  • computer vision
  • real time
  • training data
  • vision system
  • training examples
  • text mining