Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation.

Published in: CoRR (2019)

Keyphrases