Login / Signup
A Joint Modeling of Vision-Language-Action for Target-oriented Grasping in Clutter.
Kechun Xu
Shuqi Zhao
Zhongxiang Zhou
Zizhang Li
Huaijin Pi
Yifeng Zhu
Yue Wang
Rong Xiong
Published in:
CoRR (2023)
Keyphrases
</>
vision system
target detection
modeling language
image processing
computer vision
natural language
database
data sets
knowledge base
human hand
real time
machine learning
programming language
language learning