VisualT5: Multitasking Caption and Concept Prediction with Pre-trained ViT, T5 and Customized Spatial Attention in Radiological Images.

Published in: CLEF (Working Notes) (2024)

Keyphrases