• search
    search
  • reviewers
    reviewers
  • feeds
    feeds
  • assignments
    assignments
  • settings
  • logout

A Joint Modeling of Vision-Language-Action for Target-oriented Grasping in Clutter.

Kechun XuShuqi ZhaoZhongxiang ZhouZizhang LiHuaijin PiYifeng ZhuYue WangRong Xiong
Published in: CoRR (2023)
Keyphrases
  • vision system
  • target detection
  • modeling language
  • image processing
  • computer vision
  • natural language
  • database
  • data sets
  • knowledge base
  • human hand
  • real time
  • machine learning
  • programming language
  • language learning