M3T: Multi-Modal Medical Transformer to bridge Clinical Context with Visual Insights for Retinal Image Medical Description Generation.
Nagur Shareef ShaikTeja Krishna CherukuriDong Hye YePublished in: CoRR (2024)
Keyphrases
- multi modal
- medical domain
- cross modal
- medical experts
- medical data
- video search
- retinal images
- medical diagnosis
- high dimensional
- multi modality
- medical records
- medical knowledge
- diabetic retinopathy
- high level
- single modality
- audio visual
- graph cuts
- auto annotation
- clinical data
- clinical practice
- visual information
- segmentation method
- low level
- optic disc