VieCap4H-VLSP 2021: ObjectAoA - Enhancing performance of Object Relation Transformer with Attention on Attention for Vietnamese image captioning.

Published in: CoRR (2022)

Keyphrases