Polos: Multimodal Metric Learning from Human Feedback for Image Captioning.
Yuiga WadaKanta KanedaDaichi SaitoKomei SugiuraPublished in: CoRR (2024)
Keyphrases
- metric learning
- image classification
- input image
- image content
- image retrieval
- machine learning and pattern recognition
- distance metric learning
- image features
- distance metric
- feature space
- image matching
- learning tasks
- person re identification
- data sets
- semi supervised
- relevance feedback
- pairwise
- object recognition
- multi task
- image segmentation
- machine learning