Login / Signup

Fully-attentive iterative networks for region-based controllable image and video captioning.

Marcella CorniaLorenzo BaraldiAyellet TalRita Cucchiara
Published in: Comput. Vis. Image Underst. (2023)
Keyphrases