Login / Signup

Incorporating Scene Graphs into Pre-trained Vision-Language Models for Multimodal Open-vocabulary Action Recognition.

Chao WeiZhidong Deng
Published in: ICRA (2024)
Keyphrases