Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model.

Published in: CoRR (2023)

Keyphrases