Login / Signup
VisualT5: Multitasking Caption and Concept Prediction with Pre-trained ViT, T5 and Customized Spatial Attention in Radiological Images.
Diedre Carmo
Letícia Rittner
Roberto de Alencar Lotufo
Published in:
CLEF (Working Notes) (2024)
Keyphrases
</>
mean shift
pre trained
image database
image classification
image features
image retrieval
image registration
three dimensional
small number
input image
object recognition
feature points
training data
viewpoint
training set
training examples
spatial and temporal