Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model.

Published in: ACM Multimedia (2023)

Keyphrases