Depth and Video Segmentation Based Visual Attention for Embodied Question Answering.

Published in: IEEE Trans. Pattern Anal. Mach. Intell. (2023)

Keyphrases