Login / Signup
Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation.
Chuang Lin
Yi Jiang
Jianfei Cai
Lizhen Qu
Gholamreza Haffari
Zehuan Yuan
Published in:
CoRR (2021)
Keyphrases
</>
variable length
fixed length
statistical dependencies
natural language
n gram
bitstream
text compression
convolutional codes
computer vision
fuzzy logic
run length encoding
vision system
coding scheme
chain code
image processing