Enhancing Image-to-Text Generation in Radiology Reports through Cross-modal Multi-Task Learning.
Nurbanu AksoyNishant RavikumarSerge SharoffPublished in: LREC/COLING (2024)
Keyphrases
- multi task learning
- cross modal
- image data
- image classification
- multi task
- image features
- image content
- image retrieval
- multi modal
- text generation
- image representation
- visual similarity
- learning tasks
- machine learning
- gaussian processes
- transfer learning
- image collections
- learning problems
- image regions
- multi class
- similarity measure