Sign in

Multimodal attention networks for low-level vision-and-language navigation.

Federico LandiLorenzo BaraldiMarcella CorniaMassimiliano CorsiniRita Cucchiara
Published in: Comput. Vis. Image Underst. (2021)
Keyphrases